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MAMMALIAN CYTOKINE-LIKE RECEPTOR 

5 

BACKGROUND OF THE INVENTION 

Proliferation and differentiation of cells of 
multicellular organisms are controlled by hormones and 

10 polypeptide growth factors. These diff usable molecules 
allow cells to communicate with each other and act in 
concert to form cells and organs, and to repair and 
regenerate damaged tissue. Examples of hormones and 
growth factors include the steroid hormones (e.g. 

15 estrogen, testosterone) , parathyroid hormone, follicle 
stimulating hormone, the interleukins, platelet derived 
growth factor (PDGF) , epidermal growth factor (EGF) , 
granulocyte -macrophage colony stimulating factor (GM-CSF) , 
erythropoietin (EPO) and calcitonin. 

20 

Hormones and growth factors influence cellular 
metabolism by binding to receptors. Receptors may be 
integral membrane proteins that are linked to signaling 
pathways within the cell, such as second messenger 
25 systems. Other classes of receptors are soluble 
molecules, such as the transcription factors. 

Of particular interest are receptors for 
cytokines, molecules that promote the proliferation and/or 

30 differentiation of cells. Examples of cytokines include 
erythropoietin (EPO) , which stimulates the development of 
red blood cells; thrombopoietin (TPO) , which stimulates 
development of cells of the megakaryocyte lineage; and 
granulocyte -colony stimulating factor (G-CSF) , which 

35 stimulates development of neutrophils. These cytokines 
are useful in restoring normal blood cell levels in 
patients suffering from anemia or receiving chemotherapy 
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for cancer. The demonstrated in vivo activities of these 
cytokines illustrates the enormous clinical potential of, 
and need for, other cytokines, cytokine agonists, and 
cytokine antagonists. 

5 

SUMMARY OF THE INVENTION 

The present invention addresses this need by 
providing a novel mammalian cytokine-like receptor called 

10 mammalian ZcytorS, and related compositions and methods. 
Within one aspect, the present invention provides an 
isolated human polynucleotide encoding a ligand-binding 
human receptor polypeptide. The polypeptide comprises a 
sequence of amino acids containing (a) the amino acid 

15 residues of SEQ ID NO: 17, residues 35 to 422 of SEQ ID 
NO:2; (b) allelic variants of (a); and (c) sequences that 
are at least 90%, 95% or 99% identical to (a) or (b) . In 
an alternative embodiment, the polypeptide is comprised of 
amino acid residues 3 0 to and including amino acid residue 

20 422 of SEQ ID NO: 2. 

The present invention also provides for a 
polynucleotide encoding another allelic variant of SEQ ID 
NO: 2 which is a human polypeptide receptor and is defined 

25 by SEQ ID NO: 4 in particular the polypeptide comprised of 
a sequence of amino acids containing (a) the amino acid 
residues of SEQ ID NO: 18, residues 34 to 425 of SEQ ID 
N0:4; (b) allelic variants of (a); and (c) sequences that 
are at least 90%, 95% or 99% identical to (a) or (b) . In 

30 an alternative embodiment, the polypeptide is comprised of 
amino acid residues 29 to and including amino acid residue 
425 of SEQ ID NO:4. 

Other polynucleotides of the present invention 
35 encode the amino acid sequence of SEQ ID NO: 21 which is a 
soluble receptor of SEQ ID NO: 17 that does not contain a 
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C-terminus phosphatidylinositol signal sequence; the amino 
acid sequence of SEQ ID NO: 20 is a ZcytorS polypeptide of 
SEQ ID NO: 2 having an alternative N-terminus cleavage 
site; SEQ ID NO: 22 which. has an alternative N-terminus 

5 cleavage site of the ZcytorS polypeptide of SEQ ID NO: 4; 
SEQ ID NO: 23 which is an amino acid of SEQ ID NO: 18 that 
does not contain a C-terminus phosphatidylinositol signal 
sequence and the amino acid sequences defined by SEQ ID 
NOs: 24-31 which are variants of the ZcytorS polypeptide 

10 of SEQ ID NO:4. 



Another embodiment of the present invention is a 
polynucleotide which encodes rat ZcytorS. In particular, a 
polynucleotide is claimed which encodes a rat polypeptide 
15 containing (a) the amino acid sequence of SEQ ID NO: 19 

residues 41 to 425 of SEQ ID NO: 6; (b) allelic variants of 
(a) ; and (c) sequences that are at least 90%, 95% or 99% 
identical to (a) or (b) . 



20 Within a second aspect of the invention there is 

provided an expression vector comprising (a) a 
transcription promoter; (b) a DNA segment encoding a 
secretory peptide and a ligand-binding ZcyotorS receptor 
polypeptide, containing an amino acid sequence as 

25 described above. 



Within a third aspect of the invention there is 
provided a cultured eukaryotic cell into which has been 
introduced an expression vector as disclosed above, 
wherein said cell expresses a mammalian ZcytorS receptor 
polypeptide encoded by the DNA segment. 

Within a fourth aspect of the invention there is 
provided an isolated polypeptide. The polypeptide 
comprises a sequence of amino acids containing (a) the 
amino acid sequence of SEQ ID NO: 17, residues 35 to 422 
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of SEQ ID NO:2; (b) allelic variants of (a); and (c) 
sequences that are at least 90%, 95% or 99% identical to 
(a) or (b) . In an alternative embodiment, the polypeptide 
is comprised of amino acid residues 3 0 to and including 
5 amino acid residue 422 of SEQ ID NO: 2. 

The present invention also provides for another 
allelic variant of SEQ ID NO: 2 which is a human 
polypeptide receptor and is defined by SEQ ID NO: 4 in 
particular the polypeptide is comprised of a sequence of 
amino acids containing (a) the amino acid sequence of SEQ 
ID NO: 18, residues 34 to 425 of SEQ ID NO:4; (b) allelic 
variants of (a); and (c) sequences that are at least 90%, 
95% or 99% identical to (a) or (b) . In an alternative 
embodiment, the polypeptide is comprised of residues 29 to 
425 of SEQ ID NO: 4. 

Another embodiment of the present invention is a 
rat ZcytorS polypeptide containing (a) the amino acid 
20 sequence of SEQ ID NO: 19, residues 41 to 425 of SEQ ID 
NO:6; (b) allelic variants of (a); and (c) sequences that 
are at least 80% identical to (a) or (b) . 

Within a further aspect of the invention there 
is provided a chimeric polypeptide consisting essentially 
of a first portion and a second portion joined by a 
peptide bond. The first portion of the chimeric 
polypeptide consists essentially of a ZcytorS receptor 
polypeptide as described above. The invention also 
provides expression vectors encoding the chimeric 
polypeptides and host cells transfected to produce the 
chimeric polypeptides. 

The invention also provides a method for 
35 detecting a ligand within a test sample, comprising 

contacting a test sample with a ZcytorS polypeptide as 
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disclosed above, and detecting binding of the polypeptide 
to ligand in the sample. The polypeptide can be membrane 
bound within a cultured cell, wherein the detecting step 
comprises measuring a biological response in the cultured 
cell. Within another embodiment, the polypeptide is 
immobilized on a solid support. 

Within an additional aspect of the invention 
there is provided an antibody that specifically binds to a 
polypeptide as disclosed above and an anti-idiotypic 
antibody of an antibody which specifically binds to a 
ZcytorS antibody, also a method for producing an antibody 
to ZcytorS. 

An additional embodiment of the present 
invention relates to a peptide or polypeptide which has 
the amino acid sequence of an epitope-bearing portion of a 
ZcytorS polypeptide having an amino acid sequence 
described above. Peptides or polypeptides having the amino 
acid sequence of an epitope-bearing portion of a ZcytorS 
polypeptide of the present invention include portions of 
such polypeptides with at least nine, preferably at least 
15 and more preferably at least 3 0 to 50 amino acids, 
although epitope-bearing polypeptides of any length up to 
and including the entire amino acid sequence of a 
polypeptide of the present invention described above are 
also included in the present invention. Examples of said 
polypeptides are defined by the amino acid sequences of 
SEQ ID NOs: 32-37. 



DETAILED DESCRIPTION OF THE INVENTION 

The teachings of all of the references cited in 
the present specification are incorporated in their 
entirety herein by reference. 
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The term "affinity tag" is used herein to denote 
a polypeptide segment that can be attached to a second 
polypeptide to provide for purification or detection of 
the second polypeptide or provide sites for attachment of 
the second polypeptide to a substrate. In principal, any 
peptide or protein for which an antibody or other specific 
binding agent is available can be used as an affinity tag. 
Affinity tags include a poly-histidine tract, protein A 

[Nilsson et al, EMBO J . 4:1075 (1985); Nilsson et al . , 
Methods Enzymol. 198:3 (1991)], glutathione S transferase 

[Smith and Johnson, Gene 67:31 (1988)], Glu-Glu affinity 
tag [Grussenmeyer et al., Proc. Natl. Acad. Sci . USA 
52:7952-4 (1985)], substance P, FLAG™ peptide (Hopp et 
al., Biotechnology 6: 1204 -10 (1988), streptavidin binding 
peptide, or other antigenic epitope or binding domain. 
See, in general, Ford et al . , Protein Expression and 
Purification 2: 95-107 (1991) . DNAs encoding affinity 
tags are available from commercial suppliers (e.g., 
Pharmacia Biotech, Piscataway, NJ) . 

The term "allelic variant" denotes any of two or 
more alternative forms of a gene occupying the same 
chromosomal locus. Allelic variation arises naturally 
through mutation, and may result in phenotypic 
polymorphism within populations. Gene mutations can be 
silent (no change in the encoded polypeptide) or may 
encode polypeptides having altered amino acid sequence. 
The term allelic variant is also used herein to denote a 
protein encoded by an allelic variant of a gene. 

The term "allelic variant" is used herein to 
denote any of two or more alternative forms of a gene 
occupying the same chromosomal locus. Allelic variation 
arises naturally through mutation, and may result in 
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phenotypic polymorphism within populations. Gene 
mutations can be silent (no change in the encoded 
polypeptide) or may encode polypeptides having altered 
amino acid sequence. The term allelic variant is also 
5 used herein to denote a protein encoded by an allelic 
variant of a gene. 

The terms "amino-terminal" and "carboxyl- 
terminal" are used herein to denote positions within 
polypeptides. Where the context allows, these terms are 
used with reference to a particular sequence or portion of 
a polypeptide to denote proximity or relative position. 
For example, a certain sequence positioned carboxyl - 
terminal to a reference sequence within a polypeptide is 
located proximal to the carboxyl terminus of the reference 
sequence, but is not necessarily at the carboxyl terminus 
of the complete polypeptide. 

The term "complement/anti-complement pair" 
denotes non- identical moieties that form a non-covalently 
associated, stable pair under appropriate conditions. For 
instance, biotin and avidin (or streptavidin) are 
prototypical members of a complement /ant i- complement pair. 
Other exemplary complement/anti-complement pairs include 
receptor/ligand pairs, antibody /antigen (or hapten or 
epitope) pairs, sense/antisense polynucleotide pairs, and 
the like. Where subsequent dissociation of the 
complement/anti-complement pair is desirable, the 
complement/anti-complement pair preferably has a binding 
affinity of <10 9 NT 1 . 

The term "complements of a polynucleotide 
molecule" is a polynucleotide molecule having a 
complementary base sequence and reverse orientation as 
35 compared to a reference sequence. For example, the 
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sequence 5* ATGCACGGG 3' is complementary to 5' CCCGTGCAT 
3 1 . 

The term "contig" denotes a polynucleotide that 
5 has a contiguous stretch of identical or complementary 
sequence to another polynucleotide. Contiguous sequences 
are said to "overlap" a given stretch of polynucleotide 
sequence either in their entirety or alone a partial 
stretch of the polynucleotide. For example, 
10 representative contigs to the polynucleotide sequence 5 1 - 
ATGGCTTAGCTT-3 1 are 5 ' -TAGCTTgagtct-3 ' and 3 » -gtcgacTACCGA- 
5 ! . 

The term "degenerate nucleotide sequence" 
15 denotes a sequence of nucleotides that includes one or 
more degenerate codons (as compared to a reference 
polynucleotide molecule that encodes a polypeptide) . 
Degenerate codons contain different triplets of 
nucleotides, but encode the same amino acid residue (i.e., 
20 GAU and GAC triplets each encode Asp) . 



The term "expression vector" is used to denote a 
25 DNA molecule, linear or circular, that comprises a segment 
encoding a polypeptide of interest operably linked to 
additional segments that provide for its transcription. 
Such additional segments include promoter and terminator 
sequences, and may also include one or more origins of 
30 replication, one or more selectable markers, an enhancer, 
a polyadenylation signal, etc. Expression vectors are 
generally derived from plasmid or viral DNA, or may 
contain elements of both. 



35 



The term "isolated", when applied to a 
polynucleotide, denotes that the polynucleotide has been 
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removed from its natural genetic milieu and is thus free 
of other extraneous or unwanted coding sequences, and is 
in a form suitable for use within genetically engineered 
protein production systems. Such isolated molecules are 
5 those that are separated from their natural environment 
and include cDNA and genomic clones. Isolated DNA 
molecules of the present invention are free of other genes 
with which they are ordinarily associated, but may include 
naturally occurring 5 ' and 3 ' untranslated regions such as 
10 promoters and terminators. The identification of 

associated regions will be evident to one of ordinary 
skill in the art, for example, Dynan and Tijan, Nature 
316:774-78 (1985) . 

15 An "isolated" polypeptide or protein is a 

polypeptide or protein that is found in a condition other 
than its native environment, such as apart from blood and 
animal tissue. In a preferred form, the isolated 
polypeptide is substantially free of other polypeptides, 

20 particularly other polypeptides of animal origin. It is 
preferred to provide the polypeptides in a highly purified 
form, i.e. greater than 95% pure, more preferably greater 
than 99% pure. When used in this context, the term 
"isolated" does not exclude the presence of the same 

25 polypeptide in alternative physical forms, such as dimers 
or alternatively glycosylated or derivatized forms. 

"Operably linked", when referring to DNA 
segments, indicates that the segments are arranged. so that 
30 they function in concert for their intended purposes, e.g. 
transcription initiates in the promoter and proceeds 
through the coding segment to the terminator. 

The term "ortholog" (or "species homolog") 
35 denotes a polypeptide or protein obtained from one species 
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that has homology to an analogous polypeptide or protein 
from a different species. 

The term "paralog" denotes a polypeptide or 
5 protein obtained from a given species that has homology to 
a distinct polypeptide or protein from that same species. 

A "polynucleotide" is a single- or double- 
stranded polymer of deoxyribonucleotide or ribonucleotide 
10 bases read from the 5' to the 3' end. Polynucleotides 
include RNA and DNA, and may be isolated from natural 
sources, synthesized in vitro, or prepared from a 
combination of natural and synthetic molecules. 

15 A "polypeptide" is a polymer of amino acid 

residues joined by peptide bonds, whether produced 
naturally or synthetically. Polypeptides of less than 
about 10 amino acid residues are commonly referred to as 
"peptides" . 

20 

The term "promoter" is used herein for its art- 
recognized meaning to denote a portion of a gene 
containing DNA sequences that provide for the binding of 
RNA polymerase and initiation of transcription. Promoter 
25 sequences are commonly, but not always, found in the 5 ' 
non- coding regions of genes. 

A "protein" is a macromolecule comprising one or 
more polypeptide chains. A protein may also comprise non- 
30 peptidic components, such as carbohydrate groups. 
Carbohydrates and other non-peptidic substituents may be 
added to a protein by the cell in which the protein is 
produced, and will vary with the type of cell. Proteins 
are defined herein in terms of their amino acid backbone 
35 structures; substituents such as carbohydrate groups are 
generally not specified, but may be present nonetheless. 
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The term "receptor" denotes a cell-associated 
protein that binds to a bioactive molecule (i.e., a 
ligand) and mediates the effect of the ligand on the cell. 
5 Membrane -bound receptors are characterized by a multi- 
domain structure comprising an extracellular ligand- 
binding domain and an intracellular effector domain that 
is typically involved in signal transduction. Binding of 
ligand to receptor results in a conformational change in 

10 the receptor that causes an interaction between the 
effector domain and other molecule (s) in the cell. This 
interaction in turn leads to an alteration in the 
metabolism of the cell. Metabolic events that are linked 
to receptor-ligand interactions include gene 

15 transcription, phosphorylation, dephosphorylation, 

increases in cyclic AMP production, mobilization of 
cellular calcium, mobilization of membrane lipids, cell 
adhesion, hydrolysis of inositol lipids and hydrolysis of 
phospholipids. In general, receptors can be membrane 

20 bound, cytosolic or nuclear; monomeric (e.g., thyroid 
stimulating hormone receptor, beta-adrenergic receptor) or 
multimeric (e.g., PDGF receptor, growth hormone receptor, 
IL-3 receptor, GM-CSF receptor, G-CSF receptor, 
erythropoietin receptor and IL-6 receptor) . 

25 

The term "secretory signal sequence" denotes a 
DNA sequence that encodes a polypeptide (a "secretory 
peptide") that, as a component of a larger polypeptide, 
directs the larger polypeptide through a secretory pathway 
30 of a cell in which it is synthesized. The larger 
polypeptide is commonly cleaved to remove the secretory 
peptide during transit through the secretory pathway. 

The term "splice variant" is used herein to 
35 denote alternative forms of RNA transcribed from a gene. 
Splice variation arises naturally through use of 
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alternative splicing sites within a transcribed RNA 
molecule, or less commonly between separately transcribed 
RNA molecules, and may result in several mRNAs transcribed 
from the same gene. Splice variants may encode 

5 polypeptides having altered amino acid sequence. The term 
splice variant is also used herein to denote a protein 
encoded by a splice variant of an mRNA transcribed from a 
gene . 

10 A "soluble receptor" is a receptor polypeptide 

that is not bound to a cell membrane. Soluble receptors 
are most commonly ligand-binding receptor polypeptides 
that lack transmembrane and cytoplasmic domains. Soluble 
receptors can comprise additional amino acid residues, 

15 such as affinity tags that provide for purification of the 
polypeptide or provide sites for attachment of the 
polypeptide to a substrate, or immunoglobulin constant 
region sequences. Many cell-surface receptors have 
naturally occurring, soluble counterparts that are 

20 produced by proteolysis or translated from alternatively 
spliced mRNAs. Receptor polypeptides are said to be 
substantially free of transmembrane and intracellular 
polypeptide segments when they lack sufficient portions of 
these segments to provide membrane anchoring or signal 

25 transduction, respectively. 

The present invention is based in part upon the 
discovery of a novel DNA sequence that encodes a protein 
having the structure of a cytokine receptor, including the 
30 conserved WSXWS motif. Analysis of the tissue 

distribution of the mRNA corresponding to this novel DNA 
showed that expression was present in highest amounts in 
placenta, thyroid, heart and skeletal muscle with lower 
levels in prostate and trachea. 



35 
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Cytokine receptors subunits are characterized by 
a multi-domain structure comprising a ligand-binding 
domain and an effector domain that is typically involved 
in signal transduction. Multimeric cytokine receptors 
5 include homodimers (e.g., PDGF receptor act and pp 

isoforms, erythropoietin receptor, MPL [thrombopoietin 
receptor] , and G-CSF receptor) , heterodimers whose 
subunits each have ligand-binding and effector domains 
(e.g., PDGF receptor ctP isof orm) , and multimers having 

10 component subunits with disparate functions (e.g., IL-2, 
IL-3, IL-4, IL-5, IL-6, IL-7, and GM-CSF receptors) . Some 
receptor subunits are common to a plurality of receptors . 
For example, the AIC2B subunit, which cannot bind ligand 
on its own but includes an intracellular signal 

15 transduction domain, is a component of IL-3 and GM-CSF 

receptors . Many cytokine receptors can be placed into one 
of four related families on the basis of their structures 
and functions. Hematopoietic receptors, for example, are 
characterized by the presence of a domain containing 

20 conserved cysteine residues and the WSXWS motif. 

Additional domains, including protein kinase domains; 
fibronectin type III domains; and immunoglobulin domains, 
which are characterized by disulf ide-bonded loops, are 
present in certain hematopoietic receptors. Cytokine 

25 receptor structure has been reviewed by Urdal, Ann. 

Reports Med. Chem. 2^:221-228 (1991) and Cosman, Cytokine 
5:95-106 (1993) . It is generally believed that under 
selective pressure for organisms to acquire new biological 
functions, new receptor family members arose from 

30 duplication of existing receptor genes leading to the 
existence of multi-gene families. Family members thus 
contain vestiges of the ancestral gene, and these 
characteristic features can be exploited in the isolation 
and identification of additional family members. The 

35 cytokine receptor superfamily is subdivided as shown in 
Table 1. 
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Table 1 

Cytokine Receptor Superfamilv 

Immunog lobu 1 i n f ami ly 

CSF-1 receptor 

MGF receptor 

IL-1 receptor 

PDGF receptor 
Hematopoietin family 

erythropoietin receptor 

G-CSF receptor 

IL-2 receptor b-subunit 

IL-3 receptor 

IL-4 receptor 

IL-5 receptor 

IL-6 receptor 

IL-7 receptor 

IL-9 receptor 

GM-CSF receptor a-subunit 

GM-CSF receptor b-subunit 

Prolactin receptor 
CNTF receptor 

Oncostatin M receptor 

Leukemia inhibitory factor receptor 

Growth hormone receptor 

MPL 

Leptin receptor 
TNF receptor family 

TNF (p80) receptor 

TNF (p60) receptor 

TNFR-RP 

CD27 

CD30 

CD40 

4 -IBB 
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Table 1. continued 
OX-4 0 
Fas 

NGF receptor 

5 Other 

IL-2 receptor a-subunit 
IL-15 receptor a-subunit 
IFN-y receptor 

Cell -surface cytokine receptors are further 
characterized by the presence of additional domains. 
These receptors are anchored in the cell membrane by a 
transmembrane domain characterized by a sequence of 
hydrophobic amino acid residues (typically about 21-25 
residues) , which is commonly flanked by positively charged 
residues (Lys or Arg) . On the opposite end of the protein 
from the extracellular domain and separated from it by the 
transmembrane domain is an intracellular domain. 

The novel receptor of the present invention was 
initially identified by the presence of the conserved 
WSXWS motif. Analysis of a human cDNA clone encoding 
human ZcytorS (SEQ ID NO:l) revealed an open reading frame 
encoding 422 amino acids (SEQ ID NO: 2) or an allelic 
variant reveals an open reading of 425 amino acid 
residues, SEQ ID NO: 3 and SEQ ID NO:4. 

Within preferred embodiments of the invention 
the isolated polynucleotides will hybridize to similar 
30 sized regions of SEQ ID NO:l, SEQ ID NO: 3, or SEQ ID NO: 5, 
or a sequence complementary thereto, under stringent 
conditions. In general, stringent conditions are selected 
to be about 5°C lower than the thermal melting point (T m ) 
for the specific sequence at a defined ionic strength and 
35 pH. The T m is the temperature (under defined ionic 
strength and pH) at which 50% of the target sequence 
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hybridizes to a perfectly matched probe. Typical 
stringent conditions are those in which the salt 
concentration is at least about 0.02 M at pH 7 and the 
temperature is at least about 60°C. As previously noted, 
5 the isolated polynucleotides of the present invention 
include DNA and RNA. Methods for isolating DNA and RNA 
are well known in the art. It is generally preferred to 
isolate RNA from testis, including whole testis tissue 
extracts or testicular cells, such as Sertoli cells, 

10 Leydig cells, spermatogonia, or epididymis, although DNA 
can also be prepared using RNA from other tissues or 
isolated as genomic DNA. Total RNA can be prepared using 
guanidine HC1 extraction followed by isolation by 
centrifugation in a CsCl gradient [Chirgwin et al., 

15 Biochemistry 15:52-94 (1979)]. Poly (A) + RNA is prepared 
from total RNA using the method of Aviv and Leder, Proc. 
Natl. Acad. Sci . USA £.9:1408-1412 (1972). Complementary 
DNA (cDNA) is prepared from poly (A) + RNA using known 
methods. Polynucleotides encoding ZcytorS polypeptides 

20 are then identified and isolated by, for example, 
hybridization or PGR. 



Those skilled in the art will recognize that the 
25 sequences disclosed in SEQ ID NOS:l, 2, 3,4 represent 

single alleles of the human and SEQ ID NOs 5 and 6 of the 
rat Zcytor5 receptors. Allelic variants of these 
sequences can be cloned by probing cDNA or genomic 
libraries from different individuals according to standard 
30 procedures . 

The present invention further provides 
counterpart receptors and polynucleotides from other 
species ("species orthologs") . Of particular interest are 
35 ZcytorS receptors from other mammalian species, including 
murine, porcine, ovine, bovine, canine, feline, equine, 
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and other primate receptors. Species orthologs of the 
human and macaque ZcytorS receptors can be cloned using 
information and compositions provided by the present 
invention in combination with conventional cloning 
5 techniques. For example, a cDNA can be cloned using mRNA 
obtained from a tissue or cell type that expresses the 
receptor. Suitable sources of mRNA can be identified by 
probing Northern blots with probes designed from the 
sequences disclosed herein. A library is then prepared 

10 from mRNA of a positive tissue or cell line. A receptor- 
encoding cDNA can then be isolated by a variety of 
methods, such as by probing with a complete or partial 
human or macaque cDNA or with one or more sets of 
degenerate probes based on the disclosed sequences. A 

15 cDNA can also be cloned using the polymerase chain 
reaction, or PCR (Mullis, U.S. Patent No. 4,683,202), 
using primers designed from the sequences disclosed 
herein. Within an additional method, the cDNA library can 
be used to transform or transfect host cells, and 

20 expression of the cDNA of interest can be detected with an 
antibody to the receptor. Similar techniques can also be 
applied to the isolation of genomic clones. 

The polynucleotides of the present invention can 
25 be synthesized using DNA synthesizers. Currently the 
method of choice is the phosphoramidite method. If 
chemically synthesized double stranded DNA is required for 
an application such as the synthesis of a gene or a gene 
fragment, then each complementary strand is made 
30 separately. The production of short genes (60 to 80 bp) 
can be accomplished by synthesizing the complementary 
strands and then annealing them. For the production of 
longer genes (>300 bp) , however, special strategies must 
be invoked, because the coupling efficiency of each cycle 
35 during chemical DNA synthesis is seldom 100%. To overcome 
this problem, synthetic genes (double-stranded) are 
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assembled in modular form from single-stranded fragments 
that are from 20 to 100 nucleotides in length. 

One method for building a synthetic gene 
5 requires the initial production of a set of overlapping, 
complementary oligonucleotides, each of which is between 
20 to 60 nucleotides long. The sequences of the strands 
are planned so that, after annealing, the two end segments 
of the gene are aligned to give blunt or staggered ends. 

10 Each internal section of the gene has complementary 3 ' and 
5' terminal extensions that are designed to base pair 
precisely with an adjacent section. Thus, after the gene 
is assembled, the only remaining requirement to complete 
the process is sealing the nicks along the backbones of 

15 the two strands with T4 DNA ligase. In addition to the 
protein coding sequence, synthetic genes can be designed 
with terminal sequences that facilitate insertion into a 
restriction endonuclease sites of a cloning vector and 
other sequences should also be added that contain signals 

20 for the proper initiation and termination of transcription 
and translation. See Glick, Bernard R. and Jack J. 
Pasternak, Molecular Biotechnology, Principles & 
Applications of Recombinant DNA, (ASM Press, Washington, 
D.C. 1994), Itakura, K. et al. Synthesis and use of 

25 synthetic oligonucleotides. Annu. Rev. Biochem. 53 : 323- 
356 (1984), and Climie, S. et al. Chemical synthesis of 
the thymidylate synthase gene. Proc. Natl. Acad. Sci. USA 
87 :633-637 (1990) . 

30 Another embodiment of the present invention 

provides for a peptide or polypeptide comprising an 
epitope-bearing portion of a polypeptide of the invention. 
The epitope of the this polypeptide portion is an 
immunogenic or antigenic epitope of a polypeptide of the 

35 invention. A region of a protein to which an antibody can 
bind is defined as an "antigenic epitope". See for 
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instance, Geysen, H.M. et al., Proc. Natl. Acad Sci. USA 
51:3998-4002 (1984) . 

As to the selection of peptides or polypeptides 
5 bearing an antigenic epitope (i.e., that contain a region 
of a protein molecule to which an antibody can bind) , it 
is well known in the art that relatively short synthetic 
peptides that mimic part of a protein sequence are 
routinely capable of eliciting an antiserum that reacts 

10 with the partially mimicked protein. See Sutcliffe, J.G. 
et al. Science 219:660-666 (1983), Peptides capable of 
eliciting protein-reactive sera are frequently represented 
in the primary sequence of a protein, can be characterized 
by a set of simple chemical rules, and are confined 

15 neither to immunodominant regions of intact proteins 

(i.e., immunogenic epitopes) nor to the amino or carboxyl 
terminals. Peptides that are extremely hydrophobic and 
those of six or fewer residues generally are ineffective 
at inducing antibodies that bind to the mimicked protein; 

20 longer soluble peptides, especially those containing 
proline residues, usually are effective. 

Antigenic epitope-bearing peptides and 
polypeptides of the invention are therefore useful to 

25 raise antibodies, including monoclonal antibodies, that 
bind specifically to a polypeptide of the invention. 
Antigenic epitope-bearing peptides and polypeptides of the 
present invention contain a sequence of at least nine, 
preferably between 15 to about 30 amino acids contained 

30 within the amino acid sequence of a polypeptide of the 

invention. However, peptides or polypeptides comprising a 
larger portion of an amino acid sequence of the invention, 
containing from 30 to 50 amino acids, or any length up to 
and including the entire amino acid sequence of a 

35 polypeptide of the invention, also are useful for inducing 
antibodies that react with the protein. Preferably, the 
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amino acid sequence of the epitope-bearing peptide is 
selected to provide substantial solubility in aqueous 
solvents {i.e., the sequence includes relatively 
hydrophilic residues and hydrophobic residues are 
5 preferably avoided) ; and sequences containing proline 
residues are particularly preferred. All of the 
polypeptides shown in the sequence listing contain 
antigenic epitopes to be used according to the present 
invention, however, specifically designed antigenic 
10 epitopes include the peptides defined by SEQ ID NOs: 32-37. 

The present invention also provides isolated 
receptor polypeptides that are substantially identical to 

15 the receptor polypeptides of SEQ ID NOs: 2, 4 and 6 and 

their species orthologs. By "isolated" is meant a protein 
or polypeptide that is found in a condition other than its 
native environment, such as apart from blood and animal 
tissue. In a preferred form, the isolated polypeptide is 

20 substantially free of other polypeptides, particularly 
other polypeptides of animal origin. It is preferred to 
provide the polypeptides in a highly purified form, i.e. 
greater than 95% pure, more preferably greater than 99% 
pure. The term "substantially homologous" is used herein 

25 to denote polypeptides having 50%, preferably 60%, more 
preferably at least 80%, sequence identity to the 
sequences shown in SEQ ID NO: 2, 4, or 7 or their species 
orthologs. Such polypeptides will more preferably be at 
least 90% identical, and most preferably 95% or more 

30 identical to SEQ ID NO: 2, 4 or 6 or their species 

orthologs. Percent sequence identity is determined by 
conventional methods. See, for example, Altschul et al . , 
Bull. Math. Bio. 48: 603-616 (1986) and Henikoff and 
Henikoff, Proc. Natl. Acad. Sci. USA 85:10915-10919 

35 (1992) . Briefly, two amino acid sequences are aligned to 
optimize the alignment scores using a gap opening penalty 
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of 10, a gap extension penalty of 1, and the "blossom 62" 
scoring matrix of Henikoff and Henikoff (ibid.) as shown 
in Table 2 (amino acids are indicated by the standard one- 
letter codes) . The percent identity is then calculated 
5 as : 

Total number of identical matches 

x 100 

[length of the longer sequence plus the 
number of gaps introduced into the longer 
10 sequence in order to align the two 

sequences] 
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Sequence identity of polynucleotide molecules is 
determined by similar methods using a ratio as disclosed 
above . 



5 Polynucleotides, generally a cDNA sequence, of 

the present invention encode the above -described 
polypeptides. A cDNA sequence which encodes a polypeptide 
of the present invention is comprised of a series of 
codons, each amino acid residue of the polypeptide being 

10 encoded by a codon and each codon being comprised of three 
nucleotides. The amino acid residues are encoded by their 
respective codons as follows. 



Alanine (Ala) is encoded by GCA, GCC, GCG or 



15 GCT; 



Cysteine (Cys) is encoded by TGC or TGT; 
Aspartic acid (Asp) is encoded by GAC or GAT; 
Glutamic acid (Glu) is encoded by GAA or GAG; 
Phenylalanine (Phe) is encoded by TTC or TTT; 
20 Glycine (Gly) is encoded by GGA, GGC, GGG or 

GGT; 

Histidine (His) is encoded by CAC or CAT; 
Isoleucine (lie) is encoded by ATA, ATC or ATT; 
Lysine (Lys) is encoded by AAA, or AAG; 
25 Leucine (Leu) is encoded by.TTA, TTG, CTA, CTC, 

CTG or CTT; 

Methionine (Met) is encoded by ATG; 
Asparagine (Asn) is encoded by AAC or AAT; 
Proline (Pro) is encoded by CCA, CCC, CCG or 

30 CCT; 

Glut amine (Gin) is encoded by CAA or CAG; 
Arginine (Arg) is encoded by AGA, AGG, CGA, CGC, 
CGG or CGT; 

Serine (Ser) is encoded by AGC, AGT, TCA, TCC, 
35 TCG or TCT; 
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Threonine (Thr) is encoded by ACA, ACC, ACG or 

ACT; 

Valine (Val) is encoded by GTA, GTC, GTG or GTT; 
Tryptophan (Trp) is encoded by TGG; and 
5 Tyrosine (Tyr) is encoded by TAC or TAT. 

It is to be recognized that according to the 
present invention, when a cDNA is claimed as described 
above, it is understood that what is claimed are both the 

10 sense strand, the anti -sense strand, and the DNA as 
double -stranded having both the sense and anti -sense 
strand annealed together by their respective hydrogen 
bonds. Also claimed is the messenger RNA (mRNA) which 
encodes the polypeptides of the present invention, and 

15 which mRNA is encoded by the above -described cDNA. A 

messenger RNA (mRNA) will encode a polypeptide using the 
same codons as those defined above, with the exception 
that each thymine nucleotide (T) is replaced by a uracil 
nucleotide (U) . 

20 

Substantially identical proteins and 
polypeptides are characterized as having one or more amino 
acid substitutions, deletions or additions. These changes 

25 are preferably of a minor nature, that is conservative 
amino acid substitutions (see Table 3) and other 
substitutions that do not significantly affect the folding 
or activity of the protein or polypeptide; small 
deletions, typically of one to about 3 0 amino acids; and 

30 small amino- or carboxyl -terminal extensions, such as an 
amino -terminal methionine residue, a small linker peptide 
of up to about 20-25 residues, or a small extension that 
facilitates purification (an affinity tag) , such as a 
poly-histidine tract, protein A, Nilsson et al . , EMBO J. 

35 4:1075 (1985); Nilsson et al . , Methods Enzymol . 198:3 
(1991) , glutathione S transferase , Smith and Johnson, 
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Gene 67:31 (1988), or other antigenic epitope or binding 
domain. See, in general Ford et al . , Protein Expression 
and Purification 2: 95-107 (1991. DNAs encoding affinity 
tags are available from commercial suppliers (e.g., 
5 Pharmacia Biotech, Piscataway, NJ) . 



Table 3 



10 



15 



20 



25 



Conservative amino acid substitutions 



Basic : 



Acidic : 



Polar: 



Hydrophobic : 



Aromatic : 



Small : 



argmine 
lysine 
histidine 
glutamic acid 
aspartic acid 
glutamine 
asparagine 
leucine 
isoleucine 
valine 

phenylalanine 

tryptophan 

tyrosine 

glycine 

alanine 

serine 

threonine 

methionine 



Essential amino acids in the receptor 
polypeptides of the present invention can be identified 

30 according to procedures known in the art, such as site- 
directed mutagenesis or alanine-scanning mutagenesis, 
Cunningham and Wells, Science 244; 1081-1085 (1989); Bass 
et al., Proc. Natl. Acad. Sci. USA 88:4498-4502 (1991). 
In the latter technique, single alanine mutations are 

35 introduced at every residue in the molecule, and the 
resultant mutant molecules are tested for biological 
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activity (e.g., ligand binding and signal transduction) to 
identify amino acid residues that are critical to the 
activity of the molecule. Sites of ligand-receptor 
interaction can also be determined by analysis of crystal 
5 structure as determined by such techniques as nuclear 
magnetic resonance, crystallography or photoaf f inity 
labeling. See, for example, de Vos et al., Science 
255:306-312 (1992); Smith et al . , J. Mol . Biol. 224:899- 
904 (1992); Wlodaver et al . , FEBS Lett. 305:59-64 (1992). 
10 The identities of essential amino acids can also be 
inferred from analysis of homologies with related 
receptors . 

Multiple amino acid substitutions can be made 
15 and tested using known methods of mutagenesis and 

screening, such as those disclosed by Reidhaar-Olson and 
Sauer, Science 241:53-57 (1988) or Bowie and Sauer, Proc. 
Natl. Acad. Sci. USA £6:2152-2156 (1989). Briefly, these 
authors disclose methods for simultaneously randomizing 
20 two or more positions in a polypeptide, selecting for 
functional polypeptide, and then sequencing the 
mutagenized polypeptides to determine the spectrum of 
allowable substitutions at each position. Other methods 
that can be used include phage display, e.g., Lowman et 
25 al., Biochem. 30:10832-10837 (1991); Ladner et al . , U.S. 
Patent No. 5,223,409; Huse, WIPO Publication WO 92/06204) 
and region-directed mutagenesis, Derbyshire et al., Gene 
46:145 (1986); Ner et al., DNA 7:127, (1988). 

30 Mutagenesis methods as disclosed above can be 

combined with high- throughput screening methods to detect 
activity of cloned, mutagenized receptors in host cells. 
Preferred assays in this regard include cell proliferation 
assays and biosensor-based ligand-binding assays, which 

35 are described below. Mutagenized DNA molecules that encode 
active receptors or portions thereof (e.g., ligand-binding 
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fragments) can be recovered from the host cells and 
rapidly sequenced using modern equipment . These methods 
allow the rapid determination of the importance of 
individual amino acid residues in a polypeptide of 
5 interest and can be applied to polypeptides of unknown 
structure . 

Using the methods discussed above, one of 
ordinary skill in the art can prepare a variety of 
10 polypeptides that are substantially homologous to residues 
SEQ ID NOs:2, 4, 6, 17, 18, or 19 or allelic variants 
thereof and retain the ligand-binding properties of the 
wild- type receptor. 

15 The receptor polypeptides of the present 

invention, including full-length receptors, receptor 
fragments (e.g. ligand-binding fragments) , and fusion 
polypeptides can be produced in genetically engineered 
host cells according to conventional techniques. Suitable 

20 host cells are those cell types that can be transformed or 
transfected with exogenous DNA and grown in culture, and 
include bacteria, fungal cells, and cultured higher 
eukaryotic cells. Eukaryotic cells, particularly cultured 
cells of multicellular organisms, are preferred. 

25 Techniques for manipulating cloned DNA molecules and 

introducing exogenous DNA into a variety of host cells are 
disclosed by Sambrook et al Molecular Cloning: A 
Laboratory Manual, 2nd ed., Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, NY, 1989, and Ausubel et al., 

30 ibid. 

In general, a DNA sequence encoding a ZcytorS 
receptor polypeptide is operably linked to other genetic 
elements required for its expression, generally including 
35 a transcription promoter and terminator, within an 

expression vector. The vector will also commonly contain 
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one or more selectable markers and one or more origins of 
replication, although those skilled in the art will 
recognize that within certain systems selectable markers 
may be provided on separate vectors, and replication of 
5 the exogenous DNA may be provided by integration into the 
host cell genome. Selection of promoters, terminators, 
selectable markers, vectors and other elements is a matter 
of routine design within the level of ordinary skill in 
the art. Many such elements are described in the 
10 literature and are available through commercial suppliers. 

To direct a ZcytorS receptor polypeptide into 
the secretory pathway of a host cell, a secretory signal 
sequence (also known as a leader sequence, prepro sequence 

15 or pre sequence) is provided in the expression vector. 

The secretory signal sequence may be that of the receptor, 
or may be derived from another secreted protein (e.g., t- 
PA) or synthesized de novo. The secretory signal sequence 
is joined to the ZcytorS DNA sequence in the correct 

20 reading frame. Secretory signal sequences are commonly 
positioned 5' to the DNA sequence encoding the polypeptide 
of interest, although certain signal sequences may be 
positioned elsewhere in the DNA sequence of interest (see, 
e.g., Welch et al . , U.S. Patent No. 5,037,743; Holland et 

25 al., U.S. Patent No. 5,143,830). 

Cultured mammalian cells are preferred hosts 
within the present invention. Methods for introducing 
exogenous DNA into mammalian host cells include calcium 

30 phosphate-mediated transf ection, Wigler et al . , Cell 

14:725 (1978); Corsaro and Pearson, Somatic Cell Genetics 
7:603 (1981), Graham and Van der Eb, Virology 52:456 
(1973), electroporation, Neumann et al . , EMBO J. 1:841-845 
(1982) , DEAE-dextran mediated transf ection, Ausubel et 

35 al.., eds., Current Protocols in Molecular Biology (John 
Wiley and Sons, Inc., NY, 1987), and liposome-mediated 
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transfection, Hawley-Nelson et al., Focus 15:13 (1993); 
Ciccarone et al., Focus 15:80 (1993). The production of 
recombinant polypeptides in cultured mammalian cells is 
disclosed, for example, by Levinson et al . , U.S. Patent 
5 No. 4,713,339; Hagen et al . , U.S. Patent No. 4,784,950; 
Palmiter et al . , U.S. Patent No. 4,579,821; and Ringold, 
U.S. Patent No. 4,656,134. Suitable cultured mammalian 
cells include the COS-1 (ATCC No. CRL 1650), COS-7 (ATCC 
No. CRL 1651), BHK (ATCC No. CRL 1632), BHK 570 (ATCC No. 

10 CRL 10314), 293 (ATCC No. CRL 1573; Graham et al . , J. Gen. 
Virol. 3S: 59-72, (1977) and Chinese hamster ovary (e.g. 
CH0-K1; ATCC No. CCL 61) cell lines. Additional suitable 
cell lines are known in the art and available from public 
depositories such as the American Type Culture Collection, 

15 Rockville, Maryland. In general, strong transcription 
promoters are preferred, such as promoters from SV-40 or 
cytomegalovirus. See, e.g., U.S. Patent No. 4,956,288. 
Other suitable promoters include those from 
metallothionein genes (U.S. Patent Nos . 4,579,821 and 

20 4,601,978,) and the adenovirus major late promoter. 

Drug selection is generally used to select for 
cultured mammalian cells into which foreign DNA has been 
inserted. Such cells are commonly referred to as 

25 "transfectants". Cells that have been cultured in the 
presence of the selective agent and are able to pass the 
gene of interest to their progeny are referred to as 
"stable transfectants." A preferred selectable marker is 
a gene encoding resistance to the antibiotic neomycin. 

30 Selection is carried out in the presence of a neomycin- 
type drug, such as G-418 or the like. Selection systems 
may also be used to increase the expression level of the 
gene of interest, a process referred to as 
"amplification." Amplification is carried out by 

35 culturing transfectants in the presence of a low level of 
the selective agent and then increasing the amount of 
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selective agent to select for cells that produce high 
levels of the products of the introduced genes. A 
preferred amplifiable selectable marker is dihydrof olate 
reductase, which confers resistance to methotrexate. 
5 Other drug resistance genes (e.g. hygromycin resistance, 
multi-drug resistance, puromycin acetyltransf erase) can 
also be used. 

Other higher eukaryotic cells can also be used 
10 as hosts, including insect cells, plant cells and avian 
cells. Transformation of insect cells and production of 
foreign polypeptides therein is disclosed by Guarino et 
al., U.S. Patent No. 5,162,222; Bang et al., U.S. Patent 
No. 4,775,624; and WIPO publication WO 94/06463, which are 
15 incorporated herein by reference. The use of 

Agrobacterium rhizogenes as a vector for expressing genes 
in plant cells has been reviewed by Sinkar et al., J". 
Biosci. (Bangalore) 21:47-58 (1987). 

20 Fungal cells, including yeast cells, and 

particularly cells of the genus Saccharomyces , can also be 
used within the present invention, such as for producing 
receptor fragments or polypeptide fusions. Methods for 
transforming yeast cells with exogenous DNA and producing 

25 recombinant polypeptides therefrom are disclosed by, for 
example, Kawasaki, U.S. Patent No. 4,599,311; Kawasaki et 
al., U.S. Patent No. 4,931,373; Brake, U.S. Patent No. 
4,870,008; Welch et al . , U.S. Patent No. 5,037,743; and 
Murray et al . , U.S. Patent No. 4,845. Transformed cells 

30 are selected by phenotype determined by the selectable 

marker, commonly drug resistance or the ability to grow in 
the absence of a particular nutrient (e.g., leucine) . A 
preferred vector system for use in yeast is the POT1 
vector system disclosed by Kawasaki et al. (U.S. Patent 

35 No. 4,931,373), which allows transformed cells to be 

selected by growth in glucose-containing media. Suitable 



WO 98/49307 



PCT/US98/08865 



31 



promoters and terminators for use in yeast include those 

from glycolytic enzyme genes {see, e.g., Kawasaki, U.S. 

Patent No. 4,599,311; Kingsman et al., U.S. Patent No. 

4,615,974; and Bitter, U.S. Patent No. 4,977,092) and 
5 alcohol dehydrogenase genes. See also U.S. Patents Nos. 

4,990,446; 5,063,154; 5,139,936 and 4,661,454. 

Transformation systems for other yeasts, including 

Hansenula polymorpha, Schizosaccharomyces pombe, 

Kluyveromyces lactis, Kluyveromyces fragilis, Ustilago 
10 maydis, Pichia pastoris, Pichia methanolica, Pichia 

guillermondii and Candida maltosa are known in the art. 

See, for example, Gleeson et al . , J". Gen. Microbiol. 

132:3459-3465 (1986) and Cregg, U.S. Patent No. 4,882,279. 

Aspergillus cells may be utilized according to the methods 
15 of McKnight et al., U.S. Patent No. 4,935,349. Methods 

for transforming Acremonium chrysogenum are disclosed by 

Sumino et al . , U.S. Patent No. 5,162,228. Methods for 

transforming Neurospora are disclosed by Lambowitz, U.S. 

Patent No. 4,486,533. 

20 

Transformed or transfected host cells are 
cultured according to conventional procedures in a culture 
medium containing nutrients and other components required 
for the growth of the chosen host cells. A variety of 

25 suitable media, including defined media and complex media, 
are known in the art and generally include a carbon 
source, a nitrogen source, essential amino acids, vitamins 
and minerals. Media may also contain such components as 
growth factors or serum, as required. The growth medium 

30 will generally select for cells containing the exogenously 
added DNA by, for example, drug selection or deficiency in 
an essential nutrient which is complemented by the 
selectable marker carried on the expression vector or co- 
transfected into the host cell. 



35 
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Within one aspect of the present invention, a 
novel receptor is produced by a cultured cell, and the 
cell is used to screen for ligands for the receptor, 
including the natural ligand, as well as agonists and 
5 antagonists of the natural ligand. To summarize this 

approach, a cDNA or gene encoding the receptor is combined 
with other genetic elements required for its expression 
(e.g., a transcription promoter), and the resulting 
expression vector is inserted into a host cell. Cells 
10 that express the DNA and produce functional receptor are 
selected and used within a variety of screening systems. 

Mammalian cells suitable for use in expressing 
ZcytorS receptors and transducing a receptor-mediated 

15 signal include cells that express a p- subunit, such as the 
human p c subunit. In this regard it is generally 
preferred to employ a cell that is responsive to other 
cytokines that bind to receptors in the same subfamily, 
such as IL-3 or GM-CSF, because such cells will contain 

20 the requisite signal transduction pathway (s) . It is also 
preferred to use a cell from the same species as the 
receptor to be expressed. Within a preferred embodiment, * 
the cell is dependent upon an exogenously supplied 
hematopoietic growth factor for its proliferation. 

25 Preferred cell lines of this type are the human TF-1 cell 
line (ATCC number CRL-2003) and the AML-193 cell line 
(ATCC number CRL-9589) , which are GM-CSF -dependent human 
leukemic cell lines. In the alternative, suitable host 
cells can be engineered to produce a P-subunit (e.g., b c ) 

30 or other cellular component needed for the desired 
cellular response. For example, the murine cell line 
BaF3, Palacios and Steinmetz, Cell 41: 727-734 (1985); 
Mathey-Prevot et al., Mol. Cell. Biol. 6: 4133-4135 (1986) 
or a baby hamster kidney (BHK) cell line can be 

35 transfected to express the human b c subunit (also known as 
KH97) as well as a ZcytorS receptor. The latter approach 
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is advantageous because cell lines can be engineered to 
express receptor subunits from any species, thereby 
overcoming potential limitations arising from species 
specificity. In the alternative, species orthologs of the 
5 human receptor cDNA can be cloned and used within cell 
lines from the same species, such as a mouse cDNA in the 
BaF3 cell line. Cell lines that are dependent upon one 
hematopoietic growth factor, such as GM-CSF, can thus be 
engineered to become dependent upon a Zcytor5 ligand. 

10 

Cells expressing functional receptor are used 
within screening assays. A variety of suitable assays are 
known in the art. These assays are based on the detection 
of a biological response in a target cell. One such assay 

15 is a cell proliferation assay. Cells are cultured in the 
presence or absence of a test compound, and cell 
proliferation is detected by, for example, measuring 
incorporation of tritiated thymidine or by colorimetric 
assay based on the metabolic breakdown of 3- (4, 5- 

20 dimethylthiazol-2-yl) -2, 5-diphenyl tetrazolium bromide 
(MTT) (Mosman, J. Immunol. Meth. 65: 55-63 (1983). An 
alternative assay format uses cells that are further 
engineered to express a reporter gene. The reporter gene 
is linked to a promoter element that is responsive to the 

25 receptor-linked pathway, and the assay detects activation 
of transcription of the reporter gene. A preferred 
promoter element in this regard is a serum response 
element, or SRE, e.g., Shaw et al . , Cell 56:563-572 
(1989) . A preferred reporter gene is a lucif erase gene, 

30 de Wet et al., Mol. Cell. Biol. 7:725 (1987). Expression 
of the luciferase gene is detected by luminescence using 
methods known in the art (e.g., Baumgartner et al., J. 
Biol. Chem. 265:29094-29101 (1994); Schenborn and Goiffin, 
Promega Notes 41:11 (1993). Luciferase activity assay 

35 kits are commercially available from, for example, Promega 
Corp., Madison, WI . Target cell lines of this type can be 
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used to screen libraries of chemicals, cell -conditioned 
culture media, fungal broths, soil samples, water samples, 
and the like. For example, a bank of cell-conditioned 
media samples can be assayed on a target cell to identify 
5 cells that produce ligand. Positive cells are then used 
to produce a cDNA library in a mammalian expression 
vector, which is divided into pools, transfected into host 
cells, and expressed. Media samples from the transfected 
cells are then assayed, with subsequent division of pools, 
10 re-transf ection, subculturing, and re -assay of positive 
cells to isolate a cloned cDNA encoding the ligand. 

A natural ligand for the Zcytor5 receptor can 
also be identified by mutagenizing a cell line expressing 

15 the receptor and culturing it under conditions that select 
for autocrine growth. See WIPO publication WO 95/21930. 
Within a typical procedure, BaF3 cells expressing ZcytorB 
and human b c are mutagenized, such as with 2- 
ethylmethanesulfonate (EMS) . The cells are then allowed 

20 to recover in the presence of 1L-3, then transferred to a 
culture medium lacking IL-3 and IL-4. Surviving cells are 
screened for the production of a ZcytorB ligand, such as 
by adding soluble receptor to the culture medium or by 
assaying conditioned media on wild-type BaF3 cells and 

25 BaF3 cells expressing the receptor. 

An additional screening approach provided by the 
present invention includes the use of hybrid receptor 
polypeptides. These hybrid polypeptides fall into two 

30 general classes. It is preferred that the second receptor 
be a hematopoietic cytokine receptor, such as mpl receptor 
(Souyri et al., Cell 63: 1137-1147 (1990). The hybrid 
receptor will further comprise a transmembrane domain, 
which may be derived from either receptor. A DNA 

35 construct encoding the hybrid receptor is then inserted 
into a host cell. Cells expressing the hybrid receptor 
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are cultured in the presence of a ligand for the binding 
domain and assayed for a response. This system provides a 
means for analyzing signal transduction mediated by 
ZcytorS while using readily available ligands. This 
5 system can also be used to determine if particular cell 
lines are capable of responding to signals transduced by 
ZcytorS . 

Cells found to express the ligand are then used 
10 to prepare a cDNA library from which the ligand- encoding 
cDNA can be isolated as disclosed above. The present 
invention thus provides, in addition to novel receptor 
polypeptides, methods for cloning polypeptide ligands for 
the receptors. 

15 

Compounds identified as receptor agonists are 
useful for stimulating proliferation and development of 
target cells in vitro and in vivo. For example, agonist 
compounds are useful as components of defined cell culture 

20 media, and may be used alone or in combination with other 
cytokines and hormones to replace serum that is commonly 
used in cell culture. Agonists are thus useful in 
specifically promoting the growth and/or development of 
testis-derived cells in culture. Agonists and antagonists 

25 may also prove useful in the study of spermatogenesis and 
infertility. Antagonists are useful as research reagents 
for characterizing sites of ligand-receptor interaction. 
In vivo, receptor agonists may find application in the 
treatment of male infertility. Antagonists of receptor 

30 function may be useful as male contraceptive agents. 

The proposed cytokine binding domain of ZcytorS 
appears to be closest to the Interleukin-6 p-chain or 
gpl30 (29% identity) . The ligand for ZcytorS is probably 
35 a member of the Interleukin-6 family of cytokines which at 
present includes: Interleukins-6, -11, Leukemia Inhibitory 
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Factor, Oncostatin M, Cardiotropic 1 and Ciliary 
Neurtrophic Factor. 

All ZcytorS cDNAs isolated thus far do not 
5 encode a transmembrane domain nor any recognizable 

cytoplasmic signaling motifs characteristic of the Class I 
receptors. Structurally, ZcytorS bears close similarity to 
a-subunit of the Ciliary Neutrophic Factor receptor ( CNTF - 
Rcc) . It is quite possible ZcytorS does not have a 
10 transmembrane domain form and that the native molecule is 
phosphatidyl -inositol linked to the cell membrane in a 
manner similar to CNTF-Roc . 

Rebledo et al . (J. Biol. Chem. , 272: 4855-4863) 

15 provide evidence for the existence of a third component of 
the Cardiotropin-l receptor (CT-1R) . CT-1R is believed to 
have a tripartite structure comprised of gpl30, gpl90 (LIV 
Receptor 0) and an uncharacterized 45kDa (CT-lRa) subunit 
that appears to be linked to the cell surface through a 

20 phosphatidyl -inositol linkage. CT-lRa appears to be 

important for increased sensitivity and specificity of the 
receptor complex to Cardiotropin-l. The data suggests that 
ZcytorS is CT-lRa. Cardiotropin-l is a member of the 
Interleukin-6 family in which gpl3 0 and gpl90 are members 

25 of a tripartite complex is the Ciliary Neurotrophic Factor 
receptor. In this receptor complex, CNTF-Ra comprises the 
third receptor subunit and it mediates specificity and 
high affinity binding of the ligand complex. These 
functions are similar to the proposed ones for CT-lRa. One 

30 might then argue on the basis of "symmetry of nature" that 
CT-lRa would physically resemble CNTF-Ra and that the 
close structural similarity of ZcytorS to CNTF-Ra would 
make ZcytorB a possible candidate for the third subunit of 
CT-1R. Furthermore, the proposed 45 kDa molecular mass of 

35 CT-Ra agrees with that of Zcytor5 and the transcripts of 
CT-1 and ZcytorS are found in similar tissues. In 
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particular, both transcripts are found in high levels in 
heart and in skeletal muscles, which is consistent with 
the observation that ligand and their receptor subunits 
are often co-expressed in the same tissue. 

5 

Cardiotropic 1 was originally cloned by function 
as a factor involved in cardiac hypertrophy, an adaptive 
response of heart muscle to an increased work load. 
Hypertrophy is characterized by reactivation of genes 

10 expressed during fetal heart development and by the 

accumulation of carsomeric proteins. If ZcytorB proves to 
be the subunit that is important in the binding and 
specificity of Cardiotropic 1 to its receptor, ZcytorB may 
prove to be a useful therapeutic antagonist to counteract 

15 the hypertrophic response to injury. Cardiotropic 1 has 
also been shown to promote survival of rat dopaminergic 
neurons in vitro. An agonist -active soluble receptor may 
potentially be useful in the treatment of neuronal 
disorders such as Parkinson's disease. 

20 

ZcytorB may also be used within diagnostic 
systems for the detection of circulating levels of ligand. 
Within a related embodiment, antibodies or other agents 
that specifically bind to ZcytorB can be used to detect 
25 circulating receptor polypeptides. Elevated or depressed 
levels of ligand or receptor polypeptides may be 
indicative of pathological conditions, including cancer. 

ZcytorB receptor polypeptides can be prepared by 
30 expressing a DNA encoding a ZcytorB polypeptide as 

described in SEQ ID N0:1, 3 and S. To direct the export of 
the receptor domain from the host cell, the receptor DNA 
is linked to a second DNA segment encoding a secretory 
peptide. It is believed that amino acid 1-34 or in the 
35 alternative amino acid residues 1-30 are secretory 

peptides of SEQ ID NO:2. For SEQ ID NO: 4, it is believed 
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secretory peptides. For the rat sequence, it is believed 
that amino acid residues 1-40 define a secretory peptide. 
These peptides are generally cleaved after secretion by a 
5 mammalian cell. In the alternative, other secretory 

peptides could be fused to the ZcytorB polypeptide, such 
as the t-PA secretory peptide. To facilitate purification 
of the secreted receptor domain, a C-terminal extension, 
such as a poly-histidine tag, substance P, Flag™ peptide 
10 (Hopp et al., Biotechnology 6:1204-1210 (1988); available 
from Eastman Kodak Co., New Haven, CT) or another 
polypeptide or protein for which an antibody or other 
specific binding agent is available, can be fused to the 
receptor polypeptide. 

15 

In an alternative approach, a receptor 
extracellular domain can be expressed as a fusion with 
immunoglobulin heavy chain constant regions, typically an 
F c fragment, which contains two constant region domains 

20 and a hinge region but lacks the variable region. Such 
fusions are typically secreted as multimeric molecules 
wherein the Fc portions are disulfide bonded to each other 
and two receptor polypeptides are arrayed in closed 
proximity to each other. Fusions of this type can be used 

25 to affinity purify the cognate ligand from solution, as an 
in vitro assay tool, to block signals in vitro by 
specifically titrating out ligand, and as antagonists in 
vivo by administering them parenterally to bind 
circulating ligand and clear it from the circulation. To 

30 purify ligand, a 2cytor5-Ig chimera is added to a sample 
containing the ligand (e.g., cell-conditioned culture 
media or tissue extracts) under conditions that facilitate 
receptor- ligand binding (typically near-physiological 
temperature, pH, and ionic strength) . The chimera- ligand 

35 complex is then separated by the mixture using protein A, 
which is immobilized on a solid support (e.g., insoluble 
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resin beads) . The ligand is then eluted using 
conventional chemical techniques, such as with a salt or 
pH gradient. In the alternative, the chimera itself can 
be bound to a solid support, with binding and elution 
5 carried out as above. The chimeras may be used in vivo to 
induce infertility. Chimeras with high binding affinity 
are administered parenterally (e.g., by intramuscular, 
subcutaneous or intravenous injection) . Circulating 
molecules bind ligand and are cleared from circulation by 
10 normal physiological processes. For use in assays, the 
chimeras are bound to a support via the F c region and used 
in an EL ISA format . 

A preferred assay system employing a ligand- 

15 binding receptor fragment uses a commercially available 
biosensor instrument (BIAcore™, Pharmacia Biosensor, 
Piscataway, NJ) , wherein the receptor fragment is 
immobilized onto the surface of a receptor chip. Use of 
this instrument is disclosed by Karlsson, J. Immunol . 

20 Methods 145:229-240 (1991) and Cunningham and Wells, J. 
Mol. Biol. 234:554-563 (1993), A receptor fragment is 
covalently attached, using amine or sulfhydryl chemistry, 
to dextran fibers that are attached to gold film within 
the flow cell. A test sample is passed through the cell. 

25 If ligand is present in the sample, it will bind to the 
immobilized receptor polypeptide, causing a change in the 
refractive index of the medium, which is detected as a 
change in surface plasmon resonance of the gold film. 
This system allows the determination of on- and off -rates, 

30 from which binding affinity can be calculated, and 
assessment of stoichiometry of binding. 

Ligand-binding receptor polypeptides can also be 
used within other assay systems known in the art. Such 
35 systems include Scatchard analysis for determination of 
binding affinity, Scatchard, Ann. NY Acad. Sci. 51: 660- 
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672 (1949) and calorimetric assays, Cunningham et al . , 
Science 253:545-548 (1991); Cunningham et al . , Science 
254:821-825 (1991) . 

5 A receptor ligand-binding polypeptide can also 

be used for purification of ligand. The receptor 
polypeptide is immobilized on a solid support, such as 
beads of agarose, cross- linked agarose, glass, cellulosic 
resins, silica-based resins, polystyrene, cross-linked 
10 polyacrylamide, or like materials that are stable under 
the conditions of use. Methods for linking polypeptides 
to solid supports are known in the art, and include amine 
chemistry, cyanogen bromide activation, N- 
hydroxysuccinimide activation, epoxide activation, 
15 sulfhydryl activation, and hydrazide activation. The 

resulting media will generally be configured in the form 
of a column, and fluids containing ligand are passed 
through the column one or more times to allow ligand to 
bind to the receptor polypeptide. The ligand is then 
20 eluted using changes in salt concentration or pH to 
disrupt ligand- receptor binding. 

ZcytorS polypeptides can also be used to prepare 
antibodies that specifically bind to ZcytorS polypeptides. 
As used herein, the term "antibodies" includes polyclonal 
antibodies, monoclonal antibodies, antigen -binding 
fragments thereof such as F(ab») 2 and Fab fragments, and 
the like, including genetically engineered antibodies. 
Antibodies are defined to be specifically binding if they 
bind to a. ZcytorS polypeptide with a K a of greater than or 
equal to 10 7 /M. The affinity of a monoclonal antibody can 
be readily determined by one of ordinary skill in the art 
(see, for example, Scatchard, ibid.). 

ZcytorS polypeptides can also be used to prepare 
antibodies that specifically bind to ZcytorS polypeptides. 
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These antibodies can then be used to manufacture anti- 
idiotype antibodies. As used herein, the term 
"antibodies" includes polyclonal antibodies, monoclonal 
antibodies, antigen-binding fragments thereof such as 
5 F(ab')2 and Fab fragments, and the like, including 

genetically engineered antibodies. Antibodies are defined 
to be specifically binding if they bind to a ZcytorS 
polypeptide with a K a of greater than or equal to 10 7 /M. 
The affinity of a monoclonal antibody can be readily 
10 determined by one of ordinary skill in the art (see, for 
example, Scatchard, ibid.). 

Methods for preparing polyclonal and monoclonal 
antibodies are well known in the art (see for example, 

15 Sambrook et al., Molecular Cloning: A Laboratory Manual, 
(Second Edition) (Cold Spring Harbor, NY, 1989); and 
Hurrell, J. G. R., Ed., Monoclonal Hybridoma Antibodies : 
Techniques and Applications (CRC Press, Inc., Boca Raton, 
FL, 1982) . As would be evident to one of ordinary skill 

20 in the art, polyclonal antibodies can be generated from a 
variety of warm-blooded animals such as horses, cows, 
goats, sheep, dogs, chickens, rabbits, mice, and rats. 
The immunogenic ity of a ZcytorS polypeptide may be 
increased through the use of an adjuvant such as Freund's 

25 complete or incomplete adjuvant. A variety of assays 
known to those skilled in the art can be utilized to 
detect antibodies which specifically bind to ZcytorS 
polypeptides. Exemplary assays are described in detail in 
Antibodies: A Laboratory Manual, Harlow and Lane (Eds.), 

30 (Cold Spring Harbor Laboratory Press, 1988) . 

Representative examples of such assays include: concurrent 
immunoelectrophoresis, radio -immunoassays, radio- 
immunoprecipi tat ions, enzyme -linked immunosorbent assays 
(ELISA) , dot blot assays, inhibition or competition 

35 assays, and sandwich assays. 
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As would be evident to one of ordinary skill in 
the art, polyclonal antibodies can be generated by 
inoculating a variety of warm-blooded animals such as 
horses, cows, goats, sheep, dogs, chickens, rabbits, mice, 
5 hamsters, guinea pigs and rats with a ZcytorS polypeptide 
or a fragment thereof. The immunogenicity of a ZcytorS 
polypeptide may be increased through the use of an 
adjuvant, such as alum (aluminum hydroxide) or Freund's 
complete or incomplete adjuvant. Polypeptides useful for 

10 immunization also include fusion polypeptides, such as 
fusions of ZcytorS or a portion thereof with an 
immunoglobulin polypeptide or with maltose binding 
protein. The polypeptide immunogen may be a full-length 
molecule or a portion thereof. If the polypeptide portion 

15 is "hapten-like", such portion may be advantageously 
joined or linked to a macromolecular carrier (such as 
keyhole limpet hemocyanin (KLH) , bovine serum albumin 
(BSA) or tetanus toxoid) for immunization. 

20 As used herein, the term "antibodies" includes 

polyclonal antibodies, affinity-purified polyclonal 
antibodies, monoclonal antibodies, and antigen-binding 
fragments, such as F(ab , )2 and Fab proteolytic fragments. 
Genetically engineered intact antibodies or fragments, 

25 such as chimeric antibodies, Fv fragments, single chain 
antibodies and the like, as well as synthetic antigen- 
binding peptides and polypeptides, are also included. 
Non-human antibodies may be humanized by grafting non- 
human CDRs onto human framework and constant regions, or 

30 by incorporating the entire non-human variable domains 
(optionally "cloaking" them with a human-like surface by 
replacement of exposed residues, wherein the result is a 
"veneered" antibody) . In some instances, humanized 
antibodies may retain non-human residues within the human 

35 variable region framework domains to enhance proper 
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binding characteristics. Through humanizing antibodies, 
biological half-life may be increased, and the potential- 
for adverse immune reactions upon administration to humans 
is reduced. 

5 

Alternative techniques for generating or 
selecting antibodies useful herein include in vitro 
exposure of lymphocytes to ZcytorS protein or peptide, and 
selection of antibody display libraries in phage or 

10 similar vectors (for instance, through use of immobilized 
or labeled ZcytorS protein or peptide) . Genes encoding 
polypeptides having potential ZcytorS polypeptide binding 
domains can be obtained by screening random peptide 
libraries displayed on phage (phage display) or on 

15 bacteria, such as E. coll. Nucleotide sequences encoding 
the polypeptides can be obtained in a number of ways, such 
as through random mutagenesis and random polynucleotide 
synthesis. These random peptide display libraries can be 
used to screen for peptides which interact with a known 

20 target which can be a protein or polypeptide, such as a 
ligand or receptor, a biological or synthetic 
macromolecule, or organic or inorganic substances. 
Techniques for creating and screening such random peptide 
display libraries are known in the art (Ladner et al., US 

25 Patent NO. 5,223,409; Ladner et al . , US Patent NO. 

4,946,778; Ladner et al., US Patent NO. 5,403,484 and 
Ladner et al., US Patent NO. 5,571,698) and random peptide 
display libraries and kits for screening such libraries 
are available commercially, for instance from Clontech 

30 (Palo Alto, CA) , Invitrogen Inc. (San Diego, CA) , New 
England Biolabs, Inc. (Beverly, MA) and Pharmacia LKB 
Biotechnology Inc. (Piscataway, NJ) . Random peptide 
display libraries can be screened using the ZcytorS 
sequences disclosed herein to identify proteins which bind 

35 to ZcytorS. These "binding proteins" which interact with 
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ZcytorS polypeptides can be used for tagging cells; for 
isolating homolog polypeptides by affinity purification; 
they can be directly or indirectly conjugated to drugs, 
toxins, radionuclides and the like. These binding 
5 proteins can also be used in analytical methods such as 
for screening expression libraries and neutralizing 
activity. The binding proteins can also be used for 
diagnostic assays for determining circulating levels of 
polypeptides; for detecting or quantitating soluble 
10 polypeptides as marker of underlying pathology or disease. 
These binding proteins can also act as ZcytorS 
"antagonists" to block ZcytorB binding and signal 
transduction in vitro and in vivo, 

15 Antibodies can also be generated gene therapy. 

The animal is administered the DNA or RNA which encodes 
ZcytorB or an immunogenic fragment thereof so that cells 
of the animals are transfected with the nucleic acid and 
express the protein which in turn elicits an immunogenic 

20 response. Antibodies which then are produced by the animal 
are isolated in the form of polyclonal or monoclonal 
antibodies. 



25 Antibodies to ZcytorS may be used for tagging 

cells that express the protein, for affinity purification, 
within diagnostic assays for determining circulating 
levels of soluble protein polypeptides, and as antagonists 
to block ligand binding and signal transduction in vitro 

30 and in vivo. 



Methods for preparing polyclonal and monoclonal 
antibodies are well known in the art (see for example, 
35 Sambrook et al . , Molecular Cloning: A Laboratory Manual, 
Second Edition, Cold Spring Harbor, NY (1989) ; and 
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Hurrell, J. G. R., Ed., Monoclonal Hybridoma Antibodies: 
Techniques and Applications (CRC Press, Inc., Boca Raton, 
FL, 1982) . As would be evident to one of ordinary skill 
in the art, polyclonal antibodies can be generated from a 
5 variety of warm-blooded animals such as horses, cows, 
goats, sheep, dogs, chickens, rabbits, mice, and rats. 
Generally speaking, antibodies against which bind to the 
claimed ZcytorS polypeptides can be raised by immunization 
of animals with a ZcytorS polypeptide or a fragment 

10 thereof. The immunogenicity of a ZcytorS polypeptide may 
be increased through the use of an adjuvant such as 
Freund's complete or incomplete adjuvant. A variety of 
assays known to those skilled in the art can be utilized 
to detect antibodies which specifically bind to ZcytorS 

15 polypeptides. Exemplary assays are described in detail in 
Antibodies: A Laboratory Manual, Harlow and Lane (Eds.) 
(Cold Spring Harbor Laboratory Press, 1988). 
Representative examples of such assays include: concurrent 
Immunoelectrophoresis, radio- immunoassays, radio- 

20 immunoprecipitations, enzyme -linked immunosorbent assays 
(ELISA) , dot blot assays, inhibition or competition 
assays, and sandwich assays. 

Antibodies to ZcytorS are may be used for 
25 tagging cells that express the receptor, for affinity 
purification, within diagnostic assays for determining 
circulating levels of soluble receptor polypeptides, and 
as antagonists to block ligand binding and signal 
transduction in vitro and in vivo. 



30 
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Uses 

The tissue specificity of ZcytorS expression 
suggests that ZcytorS may be a receptor for growth and/or 
5 maintenance factor in the thyroid heart and skeletal 
muscle. ZcytorS could therefor be used to down regulate 
the effects of the factor by administering soluble ZcytorS 
to the patient. For example the soluble receptor could be 
used to lessen the effect of cardiotrophin-1 on cardiac 
10 pathologies. Thus preventing enlargement of the heart due 
to heart disease. ZcytorS could also be used as a 
diagnostic to test for the presence of cardiotrophin-1 in 
the blood. Furthermore, ZcytorS can be used to discover 
other possible ligands which would bind to ZcytorS. 

15 

The present invention also provides reagents 
which will find use in diagnostic applications. For 
example, the ZcytorS gene. A probe comprising the ZcytorS 
DNA or RNA or a subsequence thereof can be used to 
20 determine if the ZcytorS gene is present on chromosome 1 
or if a mutation has occurred. 

Antibodies to ZcytorS could be used to purify 
ZcytorS and as a therapeutic to modulate the effect of the 
25 ZcytorS ligand. The anti -idiotypic antibody to ZcytorS 
could be used to purify the ligand of ZcytorS and the 
administration of the anti -idiotypic antibody could be 
used to modulate the effect of the ZcytorS ligand. 

30 The invention is further illustrated by the following non- 
limiting examples. 
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Example 1 

Cloning of Human 2cytor5 

Human ZcytorS was identified from expressed 
5 sequence tag (EST) 698365 (SEQ ID NO: 7) identified in an 
EST database. The cDNA containing EST 698365 was obtained 
from Incyte Pharmaceuticals, Inc. as dried DNA. Upon 
reconstitution in water, the cDNA was transfected into £\ 
coli strain DH10B. The plasmid was designated pSL83 65. 
10 The EST in plasmid pSL8365 was sequenced, revealing an 
insert of 952 bp. 

The GENE TRAPPER® cDNA positive selection system 
(Life Technologies, Gaithersburg, MD) employing 

15 oligonucleotide ZC11,286 (SEQ ID NO: 8) was used to 
isolate the plasmid Hzcytor5-9 from a human lung cDNA 
library (obtained from Life Technologies Inc., 
Gaithersburg, MD) in accordance with the manufacturer's 
directions. Hzcytor5-9 extended the sequence of pSL8365 by 

20 459 bp. The sequence present in Hzcytor5-9 allowed the 
isolation of an overlapping EST No. 4 85212 (SEQ ID NO: 9) , 
which extended the open reading frame of Hzcytor5-9 by a 
further 33 codons . 

25 A cDNA encoding full-length ZcytorS was isolated 

from a human testis cDNA library. (See Example 2 for the 
preparation of the human cDNA testis library.) The library 
was comprised of eighty pools of plasmid DNA, each pool 
comprised of 10,000 independent recombinants. The presence 

30 of ZcytorS cDNA in each library pool was determined by PCR 
employing primers ZC11,663 (SEQ ID NO: 10) and ZC12,212 
(SEQ ID NO: 11) . PCR was carried out using AMPLITAQ® DNA 
polymerase (Perkin-Elmer) in buffer conditions recommended 
by the supplier. The amplification was carried out at 94° C 

35 for 1 minute followed by 30 cycles, each cycle consisting 
of 20 seconds at 94° C, 1 minute at 66° C and 7 minutes at 
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74° C. Five cDNA pools were found to be positive for the 
420 bp PCR product by agarose gel electrophoresis. 

Plasmid DNA from one positive library pool was 
5 electrophoresed into DH10B cells and plated. Colony lifts 
were prepared using Hybond-N filters (Amersham; Arlington 
Heights, IL) according to the procedure provided by the 
manufacturer. Following denaturation and neutralization, 
DNA was cross-linked onto the filters with 1,200 jiJoules 

10 of UV energy in a STRATAL INKER® (Stratagene Cloning 
Systems) . Cell debris was removed by several washes in 
0.25X standard sodium citrate (SSC) , 0.25% sodium dodecyl 
sulfate (SDS) and 1 mM EDTA at 65° C. The filters were then 
pre-hybridized overnight at 65° C in EXPRES SHYB ® solution 

15 (Clontech) with 1 mg/ml heat denatured salmon sperm DNA. 
Colonies positive for ZcytorS were identified by 
hybridization with a probe that was generated from EST 
484212 (SEQ ID NO: 9) cDNA employing PCR primers ZC11,663 
(SEQ ID NO: 10) and ZC12,212 (SEQ ID NO: 11) . The PCR 

20 product probe was purified by agarose gel electrophoresis. 
100 ng of the probe was labeled with 32 P dCTP using the 
MULTI-PRIME® DNA labeling system (Amersham) . 
Unincorporated label was removed with a NUCTRAP® column 
(Stratagene) . Probe hybridization was carried out 

25 overnight at 65° C in EXPRESSHYB® solution at a probe 

concentration of 1 xlO 6 cpm/ml. The filters were washed at 
65° C in a wash buffer containing 0.25X SSC, 0.25 SDS and 1 
mM EDTA. 

30 Three positive signals were identified and were 

subjected to colony purification via a second round of 
filter hybridization. Sequence analysis of one positive 
clone, SEQ ID NO: 3 was found to be full length human 
Zcytor-5. Sequencing of a several overlapping clones 

35 revealed a second full-length sequence SEQ ID NO: 1 which 
is an allelic variant of SEQ ID NO: 3. 
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Example 2 

Construction of the Human Testis cDNA Library 

5 Fourteen ^1 of poly d(T) - selected poly (A) + 

human testis mRNA (Clontech) at a concentration of 1.0 
|ag/nl was mixed with 2 [il of 20 pmole/^il first strand 
primer 2C2938 (SEQ ID NO: 12 ) containing an Sst I 
restriction site. The mixture was heated at 65° C for 4 

10 minutes and cooled by chilling on ice. First strand cDNA 
synthesis was initiated by the addition of 8 |il of 250 mM 
Tris-HCl, pH 8.3, 375 mM KC1, 15 mM MgCl 2 (5x SUPERSCRIPT™ 
buffer; GIBCO BRL) , 4 jil of 100 ™M dithiothreitol (DTT) 
and 2 [il of a deoxynucleotide triphosphate solution 

15 containing 10 mM each of dATP, dGTP, dTTP and 5 -methyl - 
dCTP (Pharmacia LKB Biotechnology Inc.) to the RNA-primer 
mixture. The reaction mixture was incubated at 45° C for 4 
minutes followed by the addition of 10 \xl of 200 U/jil 
RNase H~reverse transcriptase (GIBCO BRL) . The efficiency 

20 of the first strand synthesis was analyzed in a parallel 
reaction by the addition of 10 ^Ci of 32 P-adCTP to a 10 \il 
aliquot of the reaction mixture to label the reaction for 
analysis. The reactions were incubated at 45° C for 1 hour 
followed by an incubation at 50° C for 15 minutes. 

25 Unincorporated 32 P-adCTP in the labeled reaction was 
removed by chromatography on a 400 pore size gel 
filtration column (CHROMA SPIN + TE-400™; Clontech 
Laboratories Inc.). Unincorporated nucleotides in the 
unlabeled first strand reaction were removed by twice 

30 precipitating the cDNA in the presence of 10 |ig of 

glycogen carrier, 2.5 M ammonium acetate and 2.5 volume 
ethanol. The unlabeled cDNA was resuspended in 50 \xl water 
for use in second strand synthesis. The length of the 
labeled first strand cDNA was determined by agarose gel 

35 electrophoresis . 
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Second strand synthesis was performed on first 
strand cDNA under conditions that promoted first strand 
priming of second strand synthesis resulting in DNA 
hairpin formation. The reaction mixture was assembled at 

5 room temperature and was comprised of 66 jil of the 
unlabeled first strand cDNA, 20 fil of 5X polymerase I 
buffer (100 mM Tris: HCl, pH 7.4, 500 mM KCl, 25 mM MgCl 2 , 
50 mM (NH 4 ) 2 S0 4 ), 1 jil of 100 mM DTT, 1 fil of a solution 
containing 20 mM of each deoxynucleotide triphosphate, 3 n 

10 1 of 5 mM p-NAD, of 4 U/|il of E. coli DNA ligase (New 

England Biolabs Inc., Beverly, MA) and 5 \il of 10 u/jil E. 
coli DNA polymerase I (New England Biolabs, Inc.) . The 
reaction was . incubated at room temperature for 5 minutes 
followed by the addition of 2 (il of 2.2 U/\il RNase H 

15 (GIBCO BRL) . A parallel reaction in which a 10 ^1 aliquot 
of the second strand synthesis mixture was labeled by the 
addition of 10 jiCi 32 P-ocdCTP was used to monitor the 
efficiency of second strand synthesis. The reactions were 
incubated at 15° C for two hours followed by a 15 minute 

20 incubation at room temperature. Unincorporated 32 P-adCTP in 
the labeled reaction was removed by chromatography through 
a 400 pore size gel filtration column (Clontech 
Laboratories, Inc.) before analysis by agarose gel 
electrophoresis. The unlabeled reaction was terminated by 

25 two extractions with phenol/chloroform and a chloroform 
extraction followed by ethanol precipitation in the 
presence of 2.5 M ammonium acetate. 

The single- stranded DNA of the hairpin structure 
30 was cleaved using mung bean nuclease. The reaction mixture 
contained 100 jil of second strand cDNA, 20 \il of lOx mung 
bean nuclease buffer (Stratagene Cloning Systems, La 
Jolla, CA), 16 \il of 100 mM DTT, 51.5 nl of water and 12.5 
Hi of a 1:10 dilution of mung bean nuclease (Promega 
35 Corp.; final concentration 10.5 U/p.1) in mung bean 

nuclease dilution buffer. The reaction was incubated at 37° 
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C for 15 minutes. The reaction was terminated by the 
addition of 20 (il of 1 M Tris: HCl, pH 8.0 followed by 
sequential phenol/chloroform and chloroform extractions as 
described above. Following the extractions, the DNA was 
5 precipitated in ethanol and resuspended in water. 

The resuspended cDNA was blunt -ended with T4 DNA 
polymerase. The cDNA, which was resuspended in 138 \il of 
water, was mixed with 4 0 ^1 of 5X T4 DNA polymerase buffer 

10 (250 mM Tris: HCl, pH 8.0, 250 mM KC1, 25 mM MgCl 2 ) , 3 \il 
0.1 M DTT, 5 |xl of a solution containing 10 mM of each 
deoxynucleotide triphosphate and 4 ^1 of 1 U/)il T4 DNA 
polymerase (Boehringer Mannheim Corp., Indianapolis, IN). 
After incubation of 1 hour at 10° C, the reaction was 

15 terminated by the addition of 10 [il of 0.5 M EDTA 

followed by serial phenol /chloroform and chloroform 
extractions as described above. The DNA was 
chromatographed through a 400 pore size gel filtration 
column (Clontech Laboratories Inc. Palo Alto, CA) to 

20 remove trace levels of protein and to remove short cDNAs 
less than about 4 00 bp in length. The DNA was ethanol 
precipitated in the presence of 12 (ig glycogen carrier and 
2.5 M ammonium acetate and was resuspended in 10 \il of 
water. Based on the incorporation of 32 P-adCTP, the yield 

25 of cDNA was estimated to be about 2 fig from a starting 
template of 12.5 |ig. 

Eco RI adapters were ligated onto the 5 1 ends of 
the cDNA to enable cloning into a lambda phage vector. A 

30 10 nl aliquot of cDNA (containing about 2 ^g of cDNA) and 
11 ^1 of 65 pmole/|il of Eco RI adapter (Pharmacia LKB 
Biotechnology Inc.) were mixed with 3 \il lOx ligase buffer 
(Promega Corp.), 3 (il 10 mM ATP and 3 \il of 15 U/|al T4 DNA 
ligase (Promega Corp.). The reaction was incubated 

35 overnight (about 18 hours) at 12.5° C. The reaction was 

terminated by the addition of 150 fil of water and 10 of 
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3 M Na acetate, followed by incubation at 65° C for 30 
minutes. After incubation, the cDNA was extracted with 
phenol /chloroform and chloroform as described above and 
precipitated in the presence of 2.5 M ammonium acetate and 
5 1.2 volume of isopropanol . Following centrif ugation, the 
cDNA pellet was washed with 70% ethanol, air dried and 
resuspended in 89 fil water. 

To facilitate the directional cloning of the 
10 cDNA into a lambda phage vector, the cDNA was digested 
with Sst-I resulting in a cDNA having 5' Eco RI and 3' 
Sst-I cohesive ends. The Sst-I restriction site at the 3' 
end of the cDNA had been previously introduced through 
primer ZC2938 (SEQ ID NO: 12). Restriction enzyme 
15 digestion was carried out in a reaction containing 89 ^1 
of cDNA described above, 10 |xl of 6 mM Tris: HCl, 6 mM 
MgCl 2 , 150 mM NaCl, 1 mM DTT (lOx D buffer; Promega Corp., 
Madison, WI) and 1 nl of 12 U/*il Not I (Promega Corp.) . 
Digestion was carried out at 37° C for 1 hour. The reaction 
20 was terminated by serial phenol /chloroform and chloroform 
extractions. The cDNA was ethanol precipitated, washed 
with 70% ethanol, air dried and resuspended in 20 jil of lx 
gel loading buffer (10 mM Tris: HCl, pH 8.0, 1 mM EDTA, 5% 
glycerol and 0.125% bromphenol blue). 

25 

The resuspended cDNA was heated to 65° C for 5 
minutes, cooled on ice and electrophoresed on a 0.8% low 
melt agarose gel ( SEA PLAQUE GTG™ low melt agarose; FMC 
Corp.). Unincorporated adapters and cDNA below 1.6 kb in 

30 length were excised from the gel. The electrodes were 
reversed, and the cDNA was electrophoresed until 
concentrated near the lane of origin. The area of the gel 
containing the concentrated cDNA was excised and placed in 
a microfuge tube, and the approximate volume of the gel 

35 slice was determined. A 300 ^1 aliquot of water, 

approximately three times the volume of the gel slice, was 
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added to the tube. The agarose was then melted by heating 
to 65° C for 15 minutes. Following equilibration of the 
sample to 42° C, 10 |xl of 1 U/nl 0-agarose I ( New England 
Biolabs, Inc.) was added, and the mixture was incubated 
5 for 90 minutes to digest the agarose. After incubation, 40 
|il of 3 M Na acetate was added to the sample, and the 
mixture was incubated on ice for 15 minutes. The sample 
was centrifuged at 14,000 x g for 15 minutes at room 
temperature to remove undigested agarose. The cDNA in the 
10 supernatant was ethanol precipitated, washed in 70% 

ethanol, air-dried and resuspended in 37 \il of water for 
the kinase reaction to phosphorylate the ligated Bco RI 
adapters . 

15 To the 37 nl cDNA solution described above was 

added 10 \xl of lOx ligase buffer (Stratagene Cloning 
Systems) , and the mixture was heated to 65° C for 5 
minutes. The mixture was cooled on ice, and 5 \xl of 10 mM 
ATP and 3 (il of 10 U/jal of T4 polynucleotide kinase 

20 (Stratagene Cloning Systems) were added. The reaction was 
incubated at 3 7° C for 45 minutes and was terminated by 
heating to 65° C for 10 minutes followed by serial 
extractions with phenol /chloroform and chloroform. The 
phosphorylated cDNA was ethanol precipitated in the 

25 presence of 2.5 M ammonium acetate, washed with 70% 

ethanol, air dried and resuspended in 12.5 ^1 water. The 
concentration of the phosphorylated cDNA was estimated to 
be about 40 f mole/jil . 
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Example 3 

Northern Blot Analysis of Human ZcvtorS 

A 300bp double stranded DNA probe for Northern 
5 analysis was prepared from pSL1034 by PCR using 

oligonucleotide primers ZC 10,787 (SEQ ID NO: 13) and ZC 
11,097 (SEQ ID NO:14). The 300 bp PCR fragment was gel- 
purified using a QIAQUICK® purification kit (Qiagen Inc., 
Chatsworth, CA) and random-primer labeled using a 

10 MULT I PRIME® kit (Amersham Corp.). Labeled cDNA was 

purified from free counts using a Stratagene push column. 
Human multiple tissue Northern blots (Clontech 
Laboratories) and a human fetal tissue Northern blot 
(Clontech Laboratories) were pre-hybridized for three 

15 hours at 68 °C using EXPRESSHYB hybridization solution 
(Clontech Laboratories) . The 32 P- labeled cDNA probe was 
then added to 10 mis of fresh hybridization solution at 
10 6 cpm/ml overnight at 68°C. The blots were washed 
several times at room temperature in wash solution 

20 containing 2X SSC, 0.05% SDS, then with continuous 

agitation for 40 min at room temperature. The blots were 
then washed in 0.1X SSC, 0.1% SDS at 50 °C for 40 min with 
one change of wash solution. 

25 A single transcript of ~2.3 kb was detected 

after exposure to film. In the multiple tissue blots (MTN, 
MTN II and MTN III; Clontech Laboratories) the transcript 
was present in highest abundance in placenta, thyroid, 
heart and skeletal muscle with lower levels in prostate 

30 and trachea. Trace mRNA levels were found in kidney, 
pancreas, testis, small intestine, colon, lymph node, 
adrenal cortex and bone marrow. 
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Example 4 

Chromosomal Assignment and Placement of Human Zcvtor-5 

ZcytorS was mapped to chromosome 19 using the 
5 commercially available version of the "Stanford G3 
Radiation Hybrid Mapping Panel" (Research Genetics, Inc., 
Huntsville, AL) . The "Stanford G3 RH Panel" contains 
PCRable DNAs from each of 83 radiation hybrid clones of 
the whole human genome, plus two control DNAs (the RM 
10 donor and the A3 recipient) . A publicly available WWW 
server (http: //shgc- www.stanford.edu) allows chromosomal 
localization of markers. 

For the mapping of ZcytorS with the "Stanford G3 RH 

15 Panel", 20 /xl reactions were set up in a PCRable 96 -well 
microtiter plate (Stratagene, La Jolla, CA) and used in a 
"RoboCycler Gradient 96" thermal cycler (Stratagene) . Each 
of the 85 PGR reactions consisted of 2 /zl 10X KlenTaq PCR 
reaction buffer (CLONTECH Laboratories, Inc., Palo Alto, 

20 CA) , 1.6 fil dNTPs mix (2.5 mM each, PERKIN- ELMER, Foster 
City, CA) , 1 fil sense primer, (SEQ ID N0:13) 5' TAT GGC 
CAG GAC AAC ACA 3', 1 jil antisense primer, (SEQ ID NO:14), 
5' ATA GGG CGT AAA GAG AGC 3', 2 fxl "RediLoad" (Research 
Genetics, Inc., Huntsville, AL) , 0.4 ^1 50X Advantage 

25 KlenTaq Polymerase Mix (Clontech Laboratories, Inc.), 25 
ng of DNA from an individual hybrid clone or control and x 
111 ddH20 for a total volume of 20 /il. The reactions were 
overlaid with an equal amount of mineral oil and sealed. 
The PCR cycler conditions were as follows: an initial 1 

30 cycle 5 minute denaturation at 95°C, 35 cycles of a 1 
minute denaturation at 95°C / 1 minute annealing at 66°C and 
1.5 minute extension at 72°C, followed by a final 1 cycle 
extension of 7 minutes at 72°C. The reactions were 
separated by electrophoresis on a 2% agarose gel (Life 

35 Technologies, Gaithersburg, MD) . 
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The results showed linkage of ZcytorS to the 
framework marker WI-7289 with a LOD score of >10 and at a 
distance of 14 . 67cR_10000 from the marker. The use of 
surrounding markers positions ZcytorS in the 19pl3.1-pll 
5 region on the integrated LDB chromosome 19 map (The 
Genetic Location Database, University of Southhampton, WWW 
server: http: //cedar .genetics, soton.ac.uk/public_html/) . 

Example 5 
10 Cloning of the Rat ZcvtorS Gene 

Rat ZcytorS cDNA encoding ZcytorS was 
isolated from an amplified Rat testis cDNA library with a 
probe that was generated by primers ZC12212 (SEQ ID NO: 

15 11) and ZC10785 (SEQ ID N0:15) and 10 ng of plasmid 

pSL85212 as a template obtained from cDNA containing EST 
698365 as described in Example 1. The probe was prepared 
by PCR by combining 1 |il containing 10 ng of pSL85212, 1 
(il of ZC12212 having a concentration of 20 pmole/fil, ljal 

20 of ZC10785 having a concentration of 20 pmole/jil, 0.5|il of 
dNTP having a concentration of 20 mM of dATP, dGTP, dCTP 
and dTTP, 5 (al of 10X Klentaq polymerase buffer (Clontech) 
5nl Klentaq DNA polymerase (Clontech) and 39.5^1 water. 
The amplification was carried out at 94° C for 1 minute 

25 followed by 30 cycles, each cycle consisting of 15 seconds 
at 95° C, 20 seconds at 62° C and 1 minutes at 68° C. The 
reaction had a final incubation at 68° C for 10 minutes. 

The resulting PCR product was diluted 1:100 with 
30 water. Four ^tl of the diluted PCR product was re-amplified 
using the above -described conditions and the resultant PCR 
product was further purified by electrophoresis on low- 
melt agarose gel. The DNA probe was recovered from low- 
melt gel by digestion with P-Agarose I digestion. The rat 
35 ZcytorS gene was then cloned from a rat testis library 
which was constructed as described below in Example 6 . 
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In cloning the rat ZcytorS gene, the library was 
first amplified by plating 3.10 6 plaque forming units (pfu) 
from the previously constructed primary library onto 98 
5 150 mm NZY plates. Ten ml of serum medium was added to 
each plate and was incubated for several hours at room 
temperature. Following incubation, the phage lysates were 
collected and pooled to yield the amplified phage library. 

10 1.5 million pfus from the amplified rat testis 

cDNA library were plated onto 150 mm NZY plates at a 
density of 40,000 pfu/plate on XL-1 Blue MRF» host cells. 
Following incubation at 37° C overnight, filter lifts were 
made using HYBOND-N™ membranes (Amersham) , according to 

15 the procedures provided by the manufacturer. The filters 
were processed by denaturation in solution containing 1.5 
M NaCl and 0.5 M NaOH for 8 minutes at room temperature. 
The filters were neutralized in 0.5 M Tris: HCl, pH 7.2 
for 5 minutes. Phage DNA was fixed onto the filters with 

20 1,200 jaJoules of UV energy in a UV Cross-linker 

(Stratagene) . The filters were then washed with 0.25X SSC 
at 70° C to remove excess cellular debris. Filter pre- 
hybridization was carried out in a hybridization solution 
containing 5X SSC, 5X Denhardt solution, 0.2% SDS, 1 mM 

25 EDTA and heat denatured sheared salmon- sperm DNA at a 
final concentration of 100 H-g/ml for 72 hours at 60° C. 

75 ng of probe DNA was labeled with 32 P-dCTP 
using a MEGAPRIME® labeling kit (Amersham) and was 

30 purified with a NUCTRAP® column (Stratagene) . The labeled 
probe was heat -denatured and added to fresh hybridization 
solution at a concentration of 1.5 x 10 s cpm/ml. Into this 
solution were also added the filters containing the phage 
particles. Hybridization of the probes to the phage- 

35 containing filters was completed overnight at 4 5 c C. 
Following hybridization, the filters were washed in a 
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solution containing 0.25X SSC, 0.25% SDS and 1 mM EDTA at 
50° C. The washed filters were autoradiographed for 72 
hours at -70° C with intensifying screens. Examination of 
the autoradiographs revealed multiple regions that 
5 hybridized with the labeled probe. Agar plugs were picked 
from 56 regions for plaque purification. Of the positive 
signals, eleven produce positive phagemids following 
secondary and tertiary hybridization screens. The plasmids 
within the positive phagemids were recovered using the 
10 EXASSIT/SOLR™ system according to the manufacturer's 
specifications. A clone designated pSLRatR5-l was 
sequenced and found to encode full length Rat ZcytorB (SEQ 
ID NO: 5) 

15 

Example 6 

Production of Rat Testis cDNA library 

The rat first strand cDNA reaction contained 10 

20 nl of rat testis poly d (T) -selected poly (A) + mRNA 

(Clontech, Palo Alto, CA) at a concentration of 1 . 0 |ag/|il, 
and 2 |il of 20 pmole/|al first strand primer ZC6091 (SEQ ID 
NO: 16) containing an Xho I restriction site. The mixture 
was heated at 70°C for 4 minutes and cooled by chilling on 

25 ice. First strand cDNA synthesis was initiated by the 
addition of 8 jil of first strand buffer (5x SUPERSCRIPT™ 
buffer; Life Technologies, Gaithersburg, MD) , 4 nl of 100 
mM dithiothreitol, and 2 jil of a deoxynucleotide 
triphosphate solution containing 10 mM each of dATP, dGTP 

30 and 5 -methyl -dCTP (Pharmacia LKB Biotechnology, 

Piscataway, NJ) to the RNA-primer mixture. The reaction 
mixture was incubated at 45° C for 2 minutes, followed by 
the addition of 10 ^1 of 200 U/fil RNase H- reverse 
transcriptase (SUPERSCRIPT II®/' Life Technologies) . The 

35 efficiency of the first strand synthesis was analyzed in a 
parallel reaction by the addition of 10 jiCi of 32 P-ccdCTP 
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to 5 nl aliquot from one of the reaction mixtures to label 
the reaction for analysis. The reactions were incubated 
at 45°C for 1 hour followed by an incubation at 50° C for 
10 minutes. Unincorporated 32 P-ccdCTP in the labeled 
5 reaction was removed by chromatography on a 400 pore size 
gel filtration column (Clontech) . The unincorporated 
nucleotides and primers in the unlabeled first strand 
retains were removed by chromatography on 4 00 pore size 
gel filtration column (Clontech) . The length of labeled 
10 first strand cDNA was determined by agarose gel 
electrophoresis . 

The second strand reaction contained 102 \il of 
the unlabeled first strand cDNA, 30 |ul of 5x polymerase I 

15 buffer (125 mM Tris: HC1, pH 7.5 # 500 mM KC1, 25 mM 

MgCl 2 , 50mM (NH 4 ) 2 S0 4)>> 2 Ml of 100 mM dithiothreitol , 3 
Hi of a solution containing 10 mM of each deoxynucleotide 
triphosphate, 5 >il of 5 mM P-NAD, 2 nl of 3 U/^l E. coli 
DNA ligase (New England Biolabs) , 5 [il of 10 U/|U E. coli 

20 DNA polymerase I (New England Biolabs), and 1.5 jal of 2 U/ 
|il RNase H (Life Technologies) . A 10 \il aliquot from one 
of the second strand synthesis reactions was labeled by 
the addition of 10 nCi 32 P-ctdCTP to monitor the efficiency 
of second strand synthesis. The reactions were incubated 

25 at 16° C for two hours, followed by the addition of 10 |il 
T4 DNA polymerase (10 U/jil, Boerhinger Mannheim) and 
incubated for an additional 5 minutes at 16°C. 
Unincorporated 32 P-adCTP in the labeled reaction was 
removed by chromatography through a 4 00 pore size gel 

30 filtration column (Clontech) before analysis by agarose 
gel electrophoresis. The unlabeled was terminated by the 
addition of 20 nl 0.5 EDTA and extraction with phenol/ 
chloroform and chloroform followed by ethanol 
precipitation in the presence of 2.5 M ammonium acetate. 

35 The yield of cDNA was estimated to be approximately 2 \ig 
from starting mRNA template of 10 ng. 
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Eco RI adapters were ligated onto the 5' ends of 
the cDNA described above to enable cloning into an 
expression vector. A 10.5 ^il aliquot of cDNA C2 \xg) and 
5 5 |il of 65 pmole/nl of Eco RI adapter (Pharmacia LKB 
Biotechnology Inc.) were mixed with 2.5 nl lOx ligase 
buffer 66 mM Tris-HCl pH 7,5, 10 mM MgCl 2/ 2.5 ^1 of 10 mM 
ATP and 1 \il of 15 U/jil T4 DNA ligase (Promega Corp., 
Madison, WI) . The reaction was incubated overnight (~12 

10 hours) at 12°C. The reaction was terminated by incubation 
at 70°C for 20 minutes. After incubation, the reaction 
was cooled to 37°C. To the reaction was added 2 . 5 |xl lOmM 
ATP and 3 fil 10 U/^l T4 polynucleotide kinase (Life 
Technologies) to phosphorylate the ligated Eco RI 

15 adapters . 

To facilitate the directional cloning of the 
cDNA into an expression vector, the cDNA was digested with 
Xho I, resulting in a cDNA having a 5' Eco RI cohesive end 

20 and a 3 f Xho I cohesive end. The Xho I restriction site 
at the 3 ' end of the cDNA had been previously introduced 
using the ZC6091 primer (SEQ ID NO: 3). Restriction 
enzyme digestion was carried out in a reaction mixture 
containing 25 \xl of cDNA described above, 15 \xl of lOx H 

25 Buffer (Boehringer Mannheim), 109 \il H 2 0, and 1.0 nl of 40 
V/\il Xho I (Boehringer Mannheim) . Digestion was carried 
out at 37°C for 40 minutes. The reaction was terminated by 
incubation at 65° C for 10 minutes and chromatography 
through a 400 pore size gel filtration column (Clontech) . 

30 

The cDNA was ethanol precipitated, washed with 
70% ethanol, air dried and resuspended in 20 ^il of lx gel 
loading buffer (10 mM Tris:HCl, pH 8.0, 1 mM EDTA, 5% 
glycerol and 0.125% bromphenol blue) . The resuspended 
35 cDNA was heated to 65° C for 5 minutes, cooled on ice and 
electrophoresed on a 0.8% low melt agarose gel. The 
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contaminating adapters and cDNA below 0.6 Kb in length 
were excised from the gel. The electrodes were reversed, 
and the cDNA was electrophoresed until concentrated near 
the lane origin. The area of the gel containing the 
5 concentrated cDNA was excised and placed in a microfuge 
tube, and the approximate volume of the gel slice was 
determined. An aliquot of water approximately three times 
the volume of the gel slice (300 \xl) and 35 ^1 lOx p- 
agarose I buffer (New England Biolabs) was added to the 

10 tube, and the agarose was melted by heating to 65°C for 15 
minutes. Following equilibration of the sample to 45°C, 
3 |al of 1 U/jal p-agarose I (New England Biolabs) was 
added, and the mixture was incubated for 60 minutes at 4 5°C 
to digest the agarose. After incubation, 40 fil of 3 M Na 

15 acetate was added to the sample, and the mixture was 
incubated on ice for 15 minutes. The sample was 
centrifuged at 14,000 x g for 15 minutes at room 
temperature to remove undigested agarose. The cDNA was 
ethanol precipitated, washed in 70% ethanol, air-dried and 

20 resuspended in 10 (il water. 

The resulting cDNA was cloned into the lambda 
phage vector XZapII that was predigested with Eco RI and 
Xho I and dephosphorylated (Stratagene Cloning Systems, La 

25 Jolla, CA) . Ligation of the cDNA to the A.ZapII vector was 
carried out in a reaction mixture containing 1.0 |il of 
prepared vector, 1.0 \il of rat testis cDNA, 1 . 0 [il 10X 
Ligase Buffer (Promega) , 1.0 |il of 10 mM ATP, 5 ^1 water, 
and 1.0 |il of T4 DNA Ligase at 15 units/ml (Promega). The 

30 ligation mixture was incubated at 5°C-15°C overnight in a 
temperature gradient. After incubation, the ligation 
mixture was packaged into phage using GIGPACK III GOLD 
packaging extract (Stratagene Cloning Systems) and the 
resulting library was titered according to the 

35 manufacturer's specifications. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION 

(i) APPLICANT: ZymoGenetics, Inc. 

1201 Eastlake Avenue East 

Seattle 

WA 

USA 

98102 

(ii) TITLE OF THE INVENTION: MAMMALIAN ZCYT0R5 



(iii) NUMBER OF SEQUENCES: 37 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Zymogenetics 

(B) STREET: 1201 Eastlake Ave East 

(C) CITY: Seattle 

(D) STATE: WA 

(E) COUNTRY: USA 

(F) ZIP: 98102 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ for Windows Version 2.0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 



(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Lunn. Paul G 

(B) REGISTRATION NUMBER: 32.743 
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(0 REFERENCE/DOCKET NUMBER: 96-22PC 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 206-442-6627 

(B) TELEFAX: 206-442-6678 

(C) TELEX: 



(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1690 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 52... 1317 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

CGCCACGCGC CGAGCCGCAG CCCGCCGCGC GCCCCCGGCA GCGCCGGCCC C ATG CCC 57 

Met Pro 
1 



GCC GGC 


CGC 


CGG 


GGC 


CCC 


GCC 


GCC 


CAA 


TCC 


GCG 


CGG 


CGG 


CCG 


CCG 


CCG 


105 


Ala Gly 


Arg 


Arg 


Gly 


Pro 


Ala 


Ala 


Gin 


Ser 


Ala 


Arg 


Arg 


Pro 


Pro 


Pro 






5 










10 










15 










TTG CTG 


CCC 


CTG 


CTG 


CTG 


CTG 


CTC 


TGC 


GTC 


CTC 


GGG 


GCG 


CCG 


CGA 


GCC 


153 


Leu Leu 


Pro 


Leu 


Leu 


Leu 


Leu 


Leu 


Cys 


Val 


Leu 


Gly 


Ala 


Pro 


Arg 


Ala 




20 










25 










30 










GGA TCA 


GGA 


GCC 


CAC 


ACA 


GCT 


GTG 


ATC 


AGT 


CCC 


CAG 


GAT 


CCC 


ACG 


CTT 


201 


Gly Ser 


Gly 


Ala 


His 


Thr 


Ala 


Val 


He 


Ser 


Pro 


Gin 


Asp 


Pro 


Thr 


Leu 




35 








40 










45 








50 




CTC ATC 


GGC 


TCC 


TCC 


CTG 


CTG 


GCC 


ACC 


TGC 


TCA 


GTG 


CAC 


GGA 


GAC 


CCA 


249 


Leu He 


Gly 


Ser 


Ser 


Leu 


Leu 


Ala 


Thr 


Cys 


Ser 


Val 


His 


Gly 


Asp 


Pro 










55 










60 








65 
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CCA GGA GCC ACC GCC GAG GGC CTC TAC TGG ACC CTC AAT GGG CGC CGC 297 
Pro Gly Ala Thr Ala Glu Gly Leu Tyr Trp Thr Leu Asn Gly Arg Arg 
70 75 80 

CTG CCC CCT GAG CTC TCC CGT GTA CTC AAC GCC TCC ACC TTG GCT CTG 345 
Leu Pro Pro Glu Leu Ser Arg Val Leu Asn Ala Ser Thr Leu Ala Leu 
85 90 95 

GCC CTG GCC AAC CTC AAT GGG TCC AGG CAG CGG TCG GGG GAC AAC CTC 393 
Ala Leu Ala Asn Leu Asn Gly Ser Arg Gin Arg Ser Gly Asp Asn Leu 
100 105 110 

GTG TGC CAC GCC CGT GAC GGC AGC ATC CTG GCT GGC TCC TGC CTC TAT 441 
Val Cys His Ala Arg Asp Gly Ser He Leu Ala Gly Ser Cys Leu Tyr 
115 120 125 130 

GTT GGC CTG CCC CCA GAG AAA CCC GTC AAC ATC AGC TGC TGG TCC AAG 489 
Val Gly Leu Pro Pro Glu Lys Pro Val Asn He Ser Cys Trp Ser Lys 
135 140 145 

AAC ATG AAG GAC TTG ACC TGC CGC TGG ACG CCA GGG GCC CAC GGG GAG 537 
Asn Met Lys Asp Leu Thr Cys Arg Trp Thr Pro Gly Ala His Gly Glu 
150 155 160 

ACC TTC CTC CAC ACC AAC TAC TCC CTC AAG TAC AAG CTT AGG TGG TAT 585 
Thr Phe Leu His Thr Asn Tyr Ser Leu Lys Tyr Lys Leu Arg Trp Tyr 
165 170 175 

GGC CAG GAC AAC ACA TGT GAG GAG TAC CAC ACA GTG GGG CCC CAC TCC 633 
Gly Gin Asp Asn Thr Cys Glu Glu Tyr His Thr Val Gly Pro His Ser 
180 185 190 

TGC CAC ATC CCC AAG GAC CTG GCT CTC TTT ACG CCC TAT GAG ATC TGG 681 
Cys His He Pro Lys Asp Leu Ala Leu Phe Thr Pro Tyr Glu He Trp 
195 200 205 210 

GTG GAG GCC ACC AAC CGC CTG GGC TCT GCC CGC TCC GAT GTA CTC ACG 729 
Val Glu Ala Thr Asn Arg Leu Gly Ser Ala Arg Ser Asp Val Leu Thr 
215 220 225 

CTG GAT ATC CTG GAT GTG GTG ACC ACG GAC CCC CCG CCC GAA GTG CAC 777 
Leu Asp He Leu Asp Val Val Thr Thr Asp Pro Pro Pro Glu Val His 
230 235 240 
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GTG A6C CGC GTC GGG GGC CTG GAG GAC CAG CTG AGC GTG CGC TGG GTG 825 
Val Ser Arg Val Gly Gly Leu Glu Asp Gin Leu Ser Val Arg Trp Val 
245 250 255 

TCG CCA CCC GCC CTC AAG GAT TTC CTC TTT CAA GCC AAA TAC CAG ATC 873 
Ser Pro Pro Ala Leu Lys Asp Phe Leu Phe Gin Ala Lys Tyr Gin He 
260 265 270 

CGC TAC CGA GTG GAG GAC AGT GTG GAC TGG AAG GTG GTG GAC GAT GTG 921 
Arg Tyr Arg Val Glu Asp Ser Val Asp Trp Lys Val Val Asp Asp Val 
275 280 285 290 

AGC AAC CAG ACC TCC TGC CGC CTG GCC GGC CTG AAA CCC GGC ACC GTG 969 
Ser Asn Gin Thr Ser Cys Arg Leu Ala Gly Leu Lys Pro Gly Thr Val 
295 300 305 

TAC TTC GTG CAA GTG CGC TGC AAC CCC TTT GGC ATC TAT GGC TCC AAG 1017 
Tyr Phe Val Gin Val Arg Cys Asn Pro Phe Gly He Tyr Gly Ser Lys 
310 315 320 

AAA GCC GGG ATC TGG AGT GAG TGG AGC CAC CCC ACA GCC GCC TCC ACT 1065 
Lys Ala Gly He Trp Ser Glu Trp Ser His Pro Thr Ala Ala Ser Thr 
325 330 335 

CCC CGC AGT GAG CGC CCG GGC CCG GGC GGC GGG GCG TGC GAA CCG CGG 1113 
Pro Arg Ser Glu Arg Pro Gly Pro Gly Gly Gly Ala Cys Glu Pro Arg 
340 345 350 

GGC GGA GAG CCG AGC TCG GGG CCG GTG CGG CGC GAG CTC AAG CAG TTC 1161 
Gly Gly Glu Pro Ser Ser Gly Pro Val Arg Arg Glu Leu Lys Gin Phe 
355 360 365 370 

CTG GGC TGG CTC AAG AAG CAC GCG TAC TGC TCC AAC CTC AGC TTC CGC 1209 
Leu Gly Trp Leu Lys Lys His Ala Tyr Cys Ser Asn Leu Ser Phe Arg 
375 380 385 

CTC TAC GAC CAG TGG CGA GCC TGG ATG CAG AAG TCG CAC AAG ACC CGC 1257 
Leu Tyr Asp Gin Trp Arg Ala Trp Met Gin Lys Ser His Lys Thr Arg 
390 395 400 

AAC CAG GAC GAG GGG ATC CTG CCC TCG GGC AGA CGG GGC ACG GCG AGA 1305 
Asn Gin Asp Glu Gly He Leu Pro Ser Gly Arg Arg Gly Thr Ala Arg 
405 410 415 
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GGT CCT GCC AGA TAAGCTGTAG GGGCTCAGGC CACCCTCCCT GCCACGTGGA GACGC 1362 
Gly Pro Ala Arg 
420 

AGAGGCCGAA CCCAAACTGG GGCCACCTCT GTACCCTCAC TTCAGGGCAC CTGAGCCACC 
CTCAGCAAGA GCTGGGGTGG CCCCTGAGCT CCAACGGCCA TAACAGCTCT GACTCCCACG 
TGAGGCCACC TTTGGGTGCA CCCCAGTGGG TGTGTGTGTG TGTGTGAGGG TTGGTTGAGT 
TGCCTAGAAC CCCTGCCAGG GCTGGGGGTG AGAAGGGGAG TCATTACTCC CCATTACCTA 
GGGCCCCTCC AAAAGAGTCC TTTTAAATAA ATGAGCTATT TAGGTGCAAA AAAAAAAAAA 
AAAAAAAAAT TGCCCTCGTG CCGAATTC 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 422 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:2: 



Met 


Pro 


Ala 


Gly 


Arg 


Arg 


Gly 


Pro 


Ala 


Ala 


Gin 


Ser Ala Arg Arg Pro 


1 








5 










10 




15 


Pro 


Pro 


Leu 


Leu 


Pro 


Leu 


Leu 


Leu 


Leu 


Leu 


Cys 


Val Leu Gly Ala Pro 








20 










25 






30 


Arg 


Ala 


Gly 


Ser 


Gly 


Ala 


His 


Thr 


Ala 


Val 


He 


Ser Pro Gin Asp Pro 






35 










40 








45 


Thr 


Leu 


Leu 


He 


Gly 


Ser 


Ser 


Leu 


Leu 


Ala 


Thr 


Cys Ser Val His Gly 




50 










55 










60 


Asp 


Pro 


Pro 


Gly 


Ala 


Thr 


Ala 


Glu 


Gly 


Leu 


Tyr 


Trp Thr Leu Asn Gly 


65 










70 










75 


80 


Arg 


Arg 


Leu 


Pro 


Pro 


Glu 


Leu 


Ser 


Arg 


Val 


Leu 


Asn Ala Ser Thr Leu 










85 










90 




95 


Ala 


Leu 


Ala 


Leu 


Ala 


Asn 


Leu 


Asn 


Gly 


Ser 


Arg 


Gin Arg Ser Gly Asp 








100 










105 






110 


Asn 


Leu 


Val 


Cys 


His 


Ala 


Arg 


Asp 


Gly 


Ser 


He 


Leu Ala Gly Ser Cys 






115 










120 








125 


Leu 


Tyr 


Val 


Gly 


Leu 


Pro 


Pro 


Glu 


Lys 


Pro 


Val 


Asn He Ser Cys Trp 




130 










135 










140 


Ser 


Lys 


Asn 


Met 


Lys 


Asp 


Leu 


Thr 


Cys 


Arg 


Trp 


Thr Pro Gly Ala His 


145 










150 










155 


160 



1422 
1482 
1542 
1602 
1662 
1690 
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Gly 


Glu 


Thr 


Phe 


Leu 
165 


His 


Thr 


Asn 


Tvr 


Spr 1 pij 1 vs Tvr 1 vc 
170 


1 pi i Arn 
LCU Ml y 

17R 


Trp 


Tyr 


Gly 


Gin 
180 


Asp 


Asn 


Thr 


Cvs 


Glu 
185 


Glu Tvr His Thr Val 

UIU I J \ 1 I 1 O III! VQ 1 

190 


Gl v Prn 
U 1 y rl u 


His 


Ser 


Cys 


His 


He 


Pro 


Lvs 

LJO 


Asn 

no |j 


1 PIJ 


Ala 1 pi j Php Thr Prn 
niu lcu riic tin ri u 


Tvr Gin 
i jr 1 UlU 






195 










200 




one 


He 


Tpd 


Val 


Glu 


Ala 


Thr 


Asn 

nj 1 1 


Ara 

r\i y 


I PIJ 
LCU 


Glv ^pr Ala Arn 9or 
u i jr oci nia Ml y oci 


Acn Val 
Mop Vd 1 




210 










215 






220 


Leu 


Thr 


Leu 


Asp 


He 


1 PU 


Asn 


Val 


Val 

v a I 


Thr Thr Acn Prn Prn 
i mi i hi Mop r [ u r i u 


Prn ftln 

rl U UlU 


225 










230 








235 


240 


Val 


His 


Val 


Ser 


Ara 
245 


Val 


Glv 
u ly 


Glv 
u iy 


1 Pi l 
LcU 


Gill Acn £ln 1 aw Qor 
UlU Mop Oil) LcU oci 

C\J\J 


Val Arn 

va i Arg 


Tpd 
i i \j 


Val 


Ser 


Pro 


Prn 
i i \j 


Ala 


1 pi i 


1 vc 
Lyo 


Acn 
Mop 


Php 1 on Dr.P ftln Ala 

rile LcU rile Uin Mia 


Lys Tyr 








260 










265 


270 


Gin 


He 


Ara 


Tvr 


Aro 
ry\ y 


Val 


Glu 


Acn 
Mop 


Cop 
OCI 


Val Acn Trn 1 v/c \/al 

vd i asp i rp Lys va i 


Arr\ 

va 1 Asp 






275 










280 




285 


Asp 


Val 


Ser 


Asn 


Gin 

Will 


Thr 
1 1 ii 


Spr 


Tvc 


Arn 
mi y 


1 on Ala fZl w 1 on 1 \/c 

Leu Mia uiy Leu Lys 


Qr*r\ PI w 

rro uiy 




290 
















^nn 

ouu 


Thr 


Val 


Tvr 


Phe 


Val 

V U 1 


Gin 


Val 


Arn 
mi y 


Pvc 

Lyo 


Acn Prn Php fll v Tlo 
Mori r r u rile d 1 jr lie 


Twr PI w 

iyr uiy 


305 










310 








315 


320 


Ser 


Lvs 


Lvs 


Ala 


Glv 
325 


1 1 C 


Trn 
I r p 


Spr 

OCI 


Gin 

U 1 U 


Trn Qoi" Hi c Drn Thr 
lip oci nlo riU 1 nr 


Ala Ala 

a i a ai a 

OoO 


Ser 


Thr 


Pro 


Arg 
340 


^»pr 


Glu 


Am 


Prn 


G1 v 
u \y 

345 


Prn Glv ft! v Glv Ala 
r I U Uljr u iy Uiy Mid 

350 


Pvc Pill 

tys u i u 


Pro 


Arg 


Glv 


Glv 

u \y 


Glu 


Prn 


OCI 


OCI 


u ly 


Prn Val Arn Arn HIm 
rl U vdl Ml y Ml y UlU 


Leu Lys 






355 










360 




365 


Gin 


Phe 
370 


t PI J 


Glv 
u iy 


Trn 
I r p 


1 PI l 
LCD 


1 v/c 

Lys 
375 


Lys 


Mi c 

n I o 


Al a Twr P \/c Co» Arn 

Mia iyr Lys oer Asn 
380 


Leu Ser 


Phe 


Arg 


Leu 


Tyr 


Asp 


Gin 


Trp 


Arg 


Ala 


Trp Met Gin Lys Ser 


His Lys 


385 










390 








395 


400 


Thr 


Arg 


Asn 


Gin 


Asp 
405 


Glu 


Gly 


He 


Leu 


Pro Ser Gly Arg Arg 
410 


Gly Thr 
415 


Ala 


Arg 


Gly 


Pro 
420 


Ala 


Arg 













(2) INFORMATION FOR SEQ ID NO: 3: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1813 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 
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(A) NAME/ KEY: Coding Sequence 

(B) LOCATION: 88... 1362 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:3: 



AATTCGGCAC GAGGGGCCTG CGTCCCGCGC CGTGCGCCAC CGCCGCCGAG CCGCAGCCCG 60 
CCGCGCGCCC CCGGCAGCGC CGGCCCC ATG CCC GCC GGC CGC CGG GGC CCC GCC 114 

Met Pro Ala Gly Arg Arg Gly Pro Ala 
1 5 



GCC CAA TCC GCG CGG CGG CCG CCG CCG TTG CTG CCC CTG CTG CTG CTC 162 
Ala Gin Ser Ala Arg Arg Pro Pro Pro Leu Leu Pro Leu Leu Leu Leu 
10 15 20 25 

TGC GTC CTC GGG GCG CCG CGA GCC GGA TCA GGA GCC CAC ACA GCT GTG 210 
Cys Val Leu Gly Ala Pro Arg Ala Gly Ser Gly Ala His Thr Ala Val 
30 35 40 

ATC AGT CCC CAG GAT CCC ACG CTT CTC ATC GGC TCC TCC CTG CTG GCC 258 
He Ser Pro Gin Asp Pro Thr Leu Leu He Gly Ser Ser Leu Leu Ala 
45 50 55 

ACC TGC TCA GTG CAC GGA GAC CCA CCA GGA GCC ACC GCC GAG GGC CTC 306 
Thr Cys Ser Val His Gly Asp Pro Pro Gly Ala Thr Ala Glu Gly Leu 
60 65 70 

TAC TGG ACC CTC AAT GGG CGC CGC CTG CCC CCT GAG CTC TCC CGT GTA 354 
Tyr Trp Thr Leu Asn Gly Arg Arg Leu Pro Pro Glu Leu Ser Arg Val 
75 80 85 

CTC AAC GCC TCC ACC TTG GCT CTG GCC CTG GCC AAC CTC AAT GGG TCC 402 
Leu Asn Ala Ser Thr Leu Ala Leu Ala Leu Ala Asn Leu Asn Gly Ser 
90 95 100 105 

AGG CAG CGG TCG GGG GAC AAC CTC GTG TGC CAC GCC CGT GAC GGC AGC 450 
Arg Gin Arg Ser Gly Asp Asn Leu Val Cys His Ala Arg Asp Gly Ser 
110 115 120 

ATC CTG GCT GGC TCC TGC CTC TAT GTT GGC CTG CCC CCA GAG AAA CCC 498 
He Leu Ala Gly Ser Cys Leu Tyr Val Gly Leu Pro Pro Glu Lys Pro 
125 130 135 
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GTC AAC ATC AGC TGC TGG TCC AAG AAC ATG AAG GAC TTG ACC TGC CGC 546 
Val Asn He Ser Cys Trp Ser Lys Asn Met Lys Asp Leu Thr Cys Arg 
140 145 150 

TGG ACG CCA GGG GCC CAC GGG GAG ACC TTC CTC CAC ACC AAC TAC TCC 594 
Trp Thr Pro Gly Ala His Gly Glu Thr Phe Leu His Thr Asn Tyr Ser 
155 160 165 

CTC AAG TAC AAG CTT AGG TGG TAT GGC CAG GAC AAC ACA TGT GAG GAG 642 
Leu Lys Tyr Lys Leu Arg Trp Tyr Gly Gin Asp Asn Thr Cys Glu Glu 
170 175 180 185 

TAC CAC ACA GTG GGG CCC CAC TCC TGC CAC ATC CCC AAG GAC CTG GCT 690 
Tyr His Thr Val Gly Pro His Ser Cys His He Pro Lys Asp Leu Ala 
190 195 200 

CTC TTT ACG CCC TAT GAG ATC TGG GTG GAG GCC ACC AAC CGC CTG GGC 738 
Leu Phe Thr Pro Tyr Glu He Trp Val Glu Ala Thr Asn Arg Leu Gly 
205 210 215 

TCT GCC CGC TCC GAT GTA CTC ACG CTG GAT ATC CTG GAT GTG GTG ACC 786 
Ser Ala Arg Ser Asp Val Leu Thr Leu Asp He Leu Asp Val Val Thr 
220 225 230 

ACG GAC CCC CCG CCC GAC GTG CAC GTG AGC CGC GTC GGG GGC CTG GAG 834 
Thr Asp Pro Pro Pro Asp Val His Val Ser Arg Val Gly Gly Leu Glu 
235 240 245 

GAC CAG CTG AGC GTG CGC TGG GTG TCG CCA CCC GCC CTC AAG GAT TTC 882 
Asp Gin Leu Ser Val Arg Trp Val Ser Pro Pro Ala Leu Lys Asp Phe 
250 255 260 265 

CTC TTT CAA GCC AAA TAC CAG ATC CGC TAC CGA GTG GAG GAC AGT GTG 930 
Leu Phe Gin Ala Lys Tyr Gin He Arg Tyr Arg Val Glu Asp Ser Val 
270 275 280 

GAC TGG AAG GTG GTG GAC GAT GTG AGC AAC CAG ACC TCC TGC CGC CTG 978 
Asp Trp Lys Val Val Asp Asp Val Ser Asn Gin Thr Ser Cys Arg Leu 
285 290 295 

GCC GGC CTG AAA CCC GGC ACC GTG TAC TTC GTG CAA GTG CGC TGC AAC 1026 
Ala Gly Leu Lys Pro Gly Thr Val Tyr Phe Val Gin Val Arg Cys Asn 
300 305 310 
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CCC TTT GGC ATC TAT GGC TCC AAG AAA GCC GGG ATC TGG AGT GAG TGG 1074 
Pro Phe Gly He Tyr Gly Ser Lys Lys Ala Gly He Trp Ser Glu Trp 
315 320 325 

AGC CAC CCC ACA GCC GCC TCC ACT CCC CGC AGT GAG CGC CCG GGC CCG 1122 
Ser His Pro Thr Ala Ala Ser Thr Pro Arg Ser Glu Arg Pro Gly Pro 
330 335 340 345 

GGC GGC GGG GCG TGC GAA CCG CGG GGC GGA GAG CCG AGC TCG GGG CCG 1170 
Gly Gly Gly Ala Cys Glu Pro Arg Gly Gly Glu Pro Ser Ser Gly Pro 
350 355 360 

GTG CGG CGC GAG CTC AAG CAG TTC CTG GGC TGG CTC AAG AAG CAC GCG 1218 
Val Arg Arg Glu Leu Lys Gin Phe Leu Gly Trp Leu Lys Lys His Ala 
365 370 375 

TAC TGC TCC AAC CTC AGC TTC CGC CTC TAC GAC CAG TGG CGA GCC TGG 1266 
Tyr Cys Ser Asn Leu Ser Phe Arg Leu Tyr Asp Gin Trp Arg Ala Trp 
380 385 390 

ATG CAG AAG TCG CAC AAG ACC CGC AAC CAG CAC AGG ACG AGG GGA TCC 1314 
Met Gin Lys Ser His Lys Thr Arg Asn Gin His Arg Thr Arg Gly Ser 
395 400 405 

TGC CCT CGG GCA GAC GGG GCA CGG CGA GAG GTC CTG CCA GAT AAG CTG T 1363 
Cys Pro Arg Ala Asp Gly Ala Arg Arg Glu Val Leu Pro Asp Lys Leu 
410 415 420 425 

AGGGGCTCAG GCCACCCTCC CTGCCACGTG GAGACGCAGA GGCCGAACCC AAACTGGGGC 1423 

CACCTCTGTA CCCTCACTTC AGGGCACCTG AGCCACCCTC AGCAGGAGCT GGGGTGGCCC 1483 

CTGAGCTCCA ACGGCCATAA CAGCTCTGAC TCCCACGTGA GGCCACCTTT GGGTGCACCC 1543 

CAGTGGGTGT GTGTGTGTGT GTGAGGGTTG GTTGAGTTGC CTAGAACCCC TGCCAGGGCT 1603 

GGGGGTGAGA AGGGGAGTCA TTACTCCCCA TTACCTAGGG CCCCTCCAAA AGAGTCCTTT 1663 

TAAATAAATG AGCTATTTAG GTGCAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 1723 

AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 1783 

AAAAAAAAAA AAAAAAAAAT TTCCCGGGGA 1813 



(2) INFORMATION FOR SEQ ID NO:4: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 425 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 



Met 


Pro 


Ala 


Glv 


Ara 

rw y 


n\ y 


Glv 

u i y 


Prn 
ri u 


Ala 
M 1 a 


Ala 

M 1 d 


Gin 

u III 


Qpr* Ala A Vti Arn Drn 

ocf Mia Mry Mrg rro 


i 
± 








f; 

J 










1U 




lo 


Pro 


Prn 


i pi j 

LCU 


1 PI 1 


Prn 


1 pi i 
LcU 


t pi i 
LcU 


1 Ol 1 

LcU 


Leu 


Lys 


Val 
Va 1 


Leu biy Aia rro Arg 








C\) 










oc 

CD 






JO 


Ala 


gi v 

u iy 


Qpr 


Glv 
u iy 


Ala 
Ml d 


Hi c 
n 1 o 


i nr 


Al a 

A i a 


va i 


T 1 a 

l le 


Ser 


rro bin Asp rro (nr 






uD 










4U 








45 


1 pi i 


Leu 


T1p 

1 1 c 


Glv 
u i y 






Leu 


Leu 


ai a 


i nr 


Cue 

tys 


ber va i his b ly Asp 




OU 










DO 










oU 


Pro 


Prn 


Glv 
LJ iy 


Ala 


Thr 
1 1 ii 


Ala 

r\ 1 CI 


Glu 
U 1 u 


Gl V 

u iy 


1 Ol 1 

LcU 


lyr 


irp 


Th Iqii Arw nTi* Awn 

inr Leu Asn biy Arg 


fin 










/ U 










/o 


80 


n\ y 


1 PI 1 


Prn 


if u 


Gin 


t Pi i 
LcU 


Caw 
JCI 


Arg 


va i 


Leu 


Asn 


Aia ber inr Leu Ala 










DO 










on 




95 


LcU 


Ala 
M 1 a 


1 pi i 

LcU 


Al A 

M 1 a 


Asn 


LcU 


Asn 


b iy 


Ser 


Arg 


bin 


Arg Ser Gly Asp Asn 








i nn 

1UU 










i nc 






110 




Val 
va I 




n i o 


Al a 
M 1 a 


Mr y 


A c r\ 


gi w 
b iy 


Ser 


i le 


Leu 


Ala bly ber Lys Leu 






110 










i on 
1^1) 








i or 

125 


Tvr 
iyr 


Val 
v a i 


Gl V 


1 PI 1 
LcU 


Prn 
r i U 


Prn 
rr U 


Gill 

u 1 U 


Lys 


rTO 


\/al 

va i 


Asn 


lie Ser Cys Trp Ser 














1 ^ 

loo 










14U 


Lys 


Asn 


Met 


1 v^ 


A^n 


1 PIJ 

L^U 


Thr 
N 1 1 




Arn 
mi y 


Trn 
irp 


Thr 

Mil 


Prn Glv Ala Wi c Glu 
rr u oiy nia nio uiy 


145 










i^n 










1 

100 


10U 


Glu 


Thr 


Phe 


t PIJ 


His 

1 1 1 o 


Thr 


A<;n 

riO 1 1 


Tvr 

iyr 


OCI 


1 pi i 

LCU 


Lyb 


T\/r 1 v/c 1 pi i Arn Trr* 

\yi Lyi LcU mi y irp 










J.UiJ 










1 70 

1 / U 




1 7R 
1/D 


Tvr 


Gly 


Gin 


Asp 


Asn 


Thr 


f v<; 


Glu 


Glu 

VJ 1 u 


Tvr 


Mi 


Thr Mai Glv Prn Hie 
nn vai u iy rr u nio 








-LOU 










100 






i on 
iyu 


Ser 


Cvs 


His 


Tip 

l 1 c 


Prn 

r i u 


1 v^ 
Lyb 


Acn 


1 PI 1 
LCU 


Ala 

M 1 d 


1 pi i 

LcU 


Php 
r Tie 


Thr Drn Tvr Gin Tip 

i iir rr u iyr n i u lie 






19n 


















^UD 


Trn 


Val 


Glu 


Ala 


Thr 


A^n 


Arn 


1 PI 1 
LCU 


Glv 
u iy 


OCT 


Ala 


Arn ^pr Acn \/al 1 pi i 
Mi y oci Mbp Vd 1 LcU 




210 










215 










ecu 


Thr 


Leu 


Asp 


He 


Leu 


Asp 


Val 


Val 


Thr 


Thr 


Asp 


Pro Pro Pro Asp Val 


225 










230 










235 


240 


His 


Val 


Ser 


Arg 


Val 


Gly 


Gly 


Leu 


Glu 


Asp 


Gin 


Leu Ser Val Arg Trp 










245 










250 




255 


Val 


Ser 


Pro 


Pro 


Ala 


Leu 


Lys 


Asp 


Phe 


Leu 


Phe 


Gin Ala Lys Tyr Gin 








260 










265 






270 


He 


Arg 


Tyr 


Arg 


Val 


Glu 


Asp 


Ser 


Val 


Asp 


Trp 


Lys Val Val Asp Asp 






275 










280 








285 


Val 


Ser 


Asn 


Gin 


Thr 


Ser 


Cys 


Arg 


Leu 


Ala 


Gly 


Leu Lys Pro Gly Thr 




290 










295 










300 


Val 


Tyr 


Phe 


Val 


Gin 


Val 


Arg 


Cys 


Asn 


Pro 


Phe 


Gly He Tyr Gly Ser 


305 










310 










315 


320 
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Lys 


Lys 


Ala 


Gly 


He 


TrD 


Ser 


Glu 


Tro 


Ser His Prn Thr Ala 

wJt.i I M o ll u 1111 nig 


Ala Qor 
Mid ot?f 










325 










330 




Thr 


Pro 


Arg 


Ser 


Glu 


Arg 


Pro 


Gly 


Pro 


Glv Gly Glv Ala Cvs 










340 










345 


350 




Arg 


Gly 


Gly 


Glu 


Pro 


Ser 


Ser 


Gly 


Pro 


Val Ara Ara Glu Lpu 


1 vs Gin 

Ujf o ulll 






355 










360 






Phe 


Leu 


Gly 


Tro 


Leu 


Lvs 


Lvs 


His 


Ala 


Tvr fv^ ^pp Am 1 on 

'J' OCI Moll LcU 






370 










375 






380 




Arg 


Leu 


Tyr 


Asp 


Gin 


Trp 


Arg 


Ala 


Trp 


Met Gin Lys Ser His 


Lys Thr 


385 










390 








395 


400 


Arg 


Asn 


Gin 


His 


Arg 


Thr 


Arg 


Gly 


Ser 


Cys Pro Arg Ala Asp 


Gly Ala 










405 










410 


415 


Arg 


Arg 


Glu 


Val 


Leu 


Pro 


Asp 


Lys 


Leu 












420 










425 







(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1724 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

(A) NAME/ KEY : Coding Sequence 

(B) LOCATION: 159... 1433 
(0) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:5: 

GAATTCGGCA CGAGGAATTT CGGCTGCTCA GACTTGCTCC GGCCTTCGCT GTCCGCGCCC 60 
AGTGACGTGC GTGCGGACCC AAACCCCAAT CTGCACCCCG CAGAGTCGCC CCCGCCCCAT 120 
ACCGGCGTTG CAGTCACCGC CCGTTGCGCG CCACCCCC ATG CCC GCC GGT GGC CCG 176 

Met Pro Ala Gly Gly Pro 
1 5 

GGC CCC GCC GCC CAA TCC GCG CGG CGG CCG CCG CGG CGG CTC TCC TCG 224 
Gly Pro Ala Ala Gin Ser Ala Arg Arg Pro Pro Arg Arg Leu Ser Ser 
10 15 20 

CTG TGG TCG CCT CTG TTG CTC TGT GTT CTC GGG GTG CCT CAG GGC GGA 272 
Leu Trp Ser Pro Leu Leu Leu Cys Val Leu Gly Val Pro Gin Gly Gly 
25 30 35 
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TCG GGA GCC CAC ACA GCT GTG ATC AGT CCC CAG GAC CCC ACT CTT CTC 320 
Ser 61y Ala His Thr Ala Val lie Ser Pro Gin Asp Pro Thr Leu Leu 
40 45 50 

ATC GGA TCC TCC CTT CAT GCT ACG TGC TCT ATA CAT GGA GAC ACA CCG 368 
He Gly Ser Ser Leu His Ala Thr Cys Ser He His Gly Asp Thr Pro 
55 60 65 70 

GGG GCC ACT GCT GAG GGC CTC TAC TGG ACC CTC AAC GGC CGC CGC CTG 416 
Gly Ala Thr Ala Glu Gly Leu Tyr Trp Thr Leu Asn Gly Arg Arg Leu 
75 80 85 

CCC TCA GAG CTG TCC CGT CTC CTC AAC ACC TCC ACC CTG GCC TTG GCC 464 
Pro Ser Glu Leu Ser Arg Leu Leu Asn Thr Ser Thr Leu Ala Leu Ala 
90 95 100 

CTG GCT AAC CTT AAT GGG TCC AGG CAG CAG TCA GGG GAC AAT CTG GTG 512 
Leu Ala Asn Leu Asn Gly Ser Arg Gin Gin Ser Gly Asp Asn Leu Val 
105 110 115 

TGT CAC GCC CGA GAT GGC AGC ATT CTG GCT GGT TCC TGC CTC TAT GTT 560 
Cys His Ala Arg Asp Gly Ser He Leu Ala Gly Ser Cys Leu Tyr Val 
120 125 130 

GGT CTG CCC CCG GAG AAG CCC TTT AAC ATC AGC TGC TGG TCC CGG AAC 608 
Gly Leu Pro Pro Glu Lys Pro Phe Asn He Ser Cys Trp Ser Arg Asn 
135 140 145 150 

ATG AAG GAC CTG ACA TGC CGT TGG ACA CCG GGT GCA CAT GGG GAG ACA 656 
Met Lys Asp Leu Thr Cys Arg Trp Thr Pro Gly Ala His Gly Glu Thr 
155 160 165 

TTC CTA CAC ACC AAC TAC TCC CTC AAG TAC AAG CTG AGG TGG TAT GGT 704 
Phe Leu His Thr Asn Tyr Ser Leu Lys Tyr Lys Leu Arg Trp Tyr Gly 
170 175 180 

CAG GAC AAC ACA TGT GAG GAA TAT CAC ACT GTG GGC CCT CAC TCG TGC 752 
Gin Asp Asn Thr Cys Glu Glu Tyr His Thr Val Gly Pro His Ser Cys 
185 190 195 

CAT ATC CCC AAA GAC CTG GCC CTC TTC ACG CCC TAT GAG ATC TGG GTG 800 
His He Pro Lys Asp Leu Ala Leu Phe Thr Pro Tyr Glu He Trp Val 
200 205 210 
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GAA GCC ACC AAT CGC CTG GGT TCA GCG AGA TCT GAC GTG CTC ACA CTG 848 
Glu Ala Thr Asn Arg Leu Gly Ser Ala Arg Ser Asp Val Leu Thr Leu 
215 220 225 230 

GAT GTC CTG GAC GTG GTG ACC ACG GAC CCT CCA CCC GAC GTG CAC GTG 896 
Asp Val Leu Asp Val Val Thr Thr Asp Pro Pro Pro Asp Val His Val 
235 240 245 

AGC CGC GTT GGG GGC CTG GAG GAC CAG CTG AGT GTG CGC TGG GTC TCA 944 
Ser Arg Val Gly Gly Leu Glu Asp Gin Leu Ser Val Arg Trp Val Ser 
250 255 250 

CCA CCA GCT CTC AAG GAT TTC CTC TTC CAA GCC AAA TAC CAG ATT CGC 992 
Pro Pro Ala Leu Lys Asp Phe Leu Phe Gin Ala Lys Tyr Gin He Arg 
265 270 275 

TAC CGC GTG GAG GAC AGC GTG GAC TGG AAG GTG GTG GAT GAC GTC AGC 1040 
Tyr Arg Val Glu Asp Ser Val Asp Trp Lys Val Val Asp Asp Val Ser 
280 285 290 

AAC CAG ACC TCC TGC.CGT CTC GCG GGC TTG AAG CCC GGC ACC GTT TAC 1088 
Asn Gin Thr Ser Cys Arg Leu Ala Gly Leu Lys Pro Gly Thr Val Tyr 
295 300 305 310 

TTC GTC CAA GTT CGT TGT AAC CCA TTC GGG ATC TAT GGG TCG AAA AAG 1136 
Phe Val Gin Val Arg Cys Asn Pro Phe Gly He Tyr Gly Ser Lys Lys 
315 320 325 

GCG GGA ATC TGG AGC GAG TGG AGC CAC CCC ACC GCT GCC TCC ACC CCT 1184 
Ala Gly He Trp Ser Glu Trp Ser His Pro Thr Ala Ala Ser Thr Pro 
330 335 340 

CGA AGT GAG CGC CCG GGC CCG GGC GGC GGG GTG TGC GAG CCG CGG GGC 1232 
Arg Ser Glu Arg Pro Gly Pro Gly Gly Gly Val Cys Glu Pro Arg Gly 
345 350 355 

GGC GAG CCT AGC TCG GGC CCG GTG CGG CGC GAG CTC AAG CAG TTC CTC 1280 
Gly Glu Pro Ser Ser Gly Pro Val Arg Arg Glu Leu Lys Gin Phe Leu 
360 365 370 

GGC TGG CTC AAG AAG CAC GCG TAC TGC TCG AAC CTT AGC TTC CGC CTG 1328 
Gly Trp Leu Lys Lys His Ala Tyr Cys Ser Asn Leu Ser Phe Arg Leu 
375 380 385 390 
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TAC GAC CAG TGG CGT GCT TGG ATG CAG AAG TCA CAC AAG ACC CGA AAC 1376 
Tyr Asp Gin Trp Arg Ala Trp Met Gin Lys Ser His Lys Thr Arg Asn 
395 400 405 

CAG GAC GAG GGG ATC CTG CCC TCG GGC AGA CGG GGT GCG GCG AGA GGT 1424 
Gin Asp Glu Gly He Leu Pro Ser Gly Arg Arg Gly Ala Ala Arg Gly 
410 415 420 

CCT GCC GGC TAAACTCTGA GGATAGGCCA TCCTCCTGCT GGATGCAGAC CTGGAGGCT 1482 
Pro Ala Gly 
425 

CACCTGAACT GGAGACCATC TGTACTGTCA CTTTGGGGCA ATGAAGAAAC AAACCAGGGG 
CTGGGGCACA ATGAGCTCCC ACAACCACAG CTTTGGCCAC ATGATGGTCA ACTTTGGATG 
TACCCCAATA TGGGTAGGGT TGGAGTAATG ACAAGGGTTA TGCAGGACCC TCCAAGAGTC 
TCTTTGAATA AATAAGAAAA GAGTTGTTCA GGAAAAAAAA AAAAAAAAAA AATAGCGGCC 
GC 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 425 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi). SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



Met 


Pro 


Ala 


Gly 


Gly 


Pro 


Gly 


Pro 


Ala 


Ala 


Gin 


Ser Ala Arg Arg Pro 


1 








5 










10 




15 


Pro 


Arg 


Arg 


Leu 


Ser 


Ser 


Leu 


Trp 


Ser 


Pro 


Leu 


Leu Leu Cys Val Leu 








20 










25 






30 


Gly 


Val 


Pro 


Gin 


Gly 


Gly 


Ser 


Gly 


Ala 


His 


Thr 


Ala Val He Ser Pro 






35 










40 








45 


Gin 


Asp 


Pro 


Thr 


Leu 


Leu 


He 


Gly 


Ser 


Ser 


Leu 


His Ala Thr Cys Ser 




50 










55 










60 


He 


His 


Gly 


Asp 


Thr 


Pro 


Gly 


Ala 


Thr 


Ala 


Glu 


Gly Leu Tyr Trp Thr 


65 










70 










75 


80 


Leu 


Asn 


Gly 


Arg 


Arg 


Leu 


Pro 


Ser 


Glu 


Leu 


Ser 


Arg Leu Leu Asn Thr 










85 










90 




95 


Ser 


Thr 


Leu 


Ala 


Leu 


Ala 


Leu 


Ala 


Asn 


Leu 


Asn 


Gly Ser Arg Gin Gin 








100 










105 






110 



1542 
1602 
1662 
1722 
1724 
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Ser Gly 


Asp 


Asn 


Leu 


Val 


Tvs 


His 


Ala 

r\ i a 


Arn Acn 
Ml y Mop 


&1 v/ Qor Tip 1 on Ala 
toiy oci lie LcU Mid 




115 


















Gly Ser 


Cvs 


i pi j 


Tvr 


Val 

V u 1 


gi v 

u ly 


1 PI 1 
LcU 


Prn 
ri u 


Prn PI ii 
ri u to 1 U 


1 \/c D/'A Dho Acn T 1 s\ 

Lys rro rne Msn lie 


130 










1^5 








1 Af\ 




Trn 


Spr 


Arn 
rM y 


Acn 


lie L 


1 \/c 

Lys 


Acn 
nop 


1 £11 1 TK r 

LcU III! 


Lys Mrg irp inr rro 


145 








150 








100 


IDU 


Glv Ala 

VI 1 Jr PI I a 


His 


Glv 
u iy 


Gill 
U 1 u 


Thr 
I fir 


rile 


LcU 


n 1 o 


Tin n A c* ki 

inr MoH 


Twn Caw 1 /mi 1 v / /— "T"\ < 

iyr oer Leu Lys lyr 








165 










170 


I/O 


i v<\ i pi j 


Ara 


Trn 


T\/r 

iyr 


b ly 


Gin 

va i n 


Acr\ 


Acn 
MoM 


inr uys 


Pill Pill T\/^ U-i r TUia 

biu biu iyr His inr 






180 










Ift5 

100 






Val Glv 

v a i vj i jr 


Prn 


Hie 


Qpr 
oci 


Pvc 

Lyo 


Mi c 


Tlo 

1 1 e 


Drn 

rTO 


Lys Asp 


Leu Ala Leu rne inr 




195 


















Prn Tvr 


Rln 

U 1 LI 


T1p 
l l c 


irp 


va 1 


u 1 U 


AT a 

A I a 


i nr 


Asn Arg 


Leu Gly Ser Ala Arg 


210 

C X \J 










£10 








lcSj 


Cap Acn 
Oct nofJ 


Val 
va i 


1 Ol 1 

LcU 


Thr 


1 Ol 1 

LcU 


Asp 


va I 


Leu 


Asp va ! 


Va I T nr Thr Asp Pro 


225 


















240 


Prn Prn 
n u ri u 


Acn 


Val 
va i 


Mi c 
n i o 


Va 1 


oer 


Arg 


Wal 

va i 


PI \/ PT w 

oiy b ly 


Leu biu Asp bin Leu 








245 

t.*tO 










CDXJ 




Ser Val 


Arn 
ni y 


Trn 


Val 
vd I 


Cor 1 
JCI 


Prn 
r f 0 


rro 


Al a 

a i a 


Leu Lys 


a i>i>\ nt%ss i i i nUn pi ,« 

Asp me Leu rne bin 






260 










£00 




97fl 


Ala Lvs 


Tvr 


Gin 

Hill 


T1p 
1 1 c 


Ann 

mi y 


T 

lyr 


Arg 


va i 


P"|ii A rn 

tolU ASp 


Ser Val Asp Trp Lys 




275 










9ft0 






COD 


Val Val 

V U 1 V □ 1 


Acn 

nip 


Acn 
Mop 


Val 
vd I 


Cop 

OCI 


M5H 


u i n 


Thr 

i nr 


Ser Cys 


Arg Leu Ala biy Leu 


290 










£:?0 








JUU 


1 v^ Pro 


Glv 
u iy 


Thr 


Val 
vd I 


lyr 


Php 
rile 


Vd 1 


to i n 


\/aT Arn 

va i Mrg 


Pi/p Acn D^a Dk/> PI w 

Lys Asn rro rne biy 


305 








^10 








^1 5 
010 


'ion 


He Tvr 
1 1 v. 


Glv 

VI I J 


Ser 


1 vc 


1 vc 
Lyo 


Ala 


Gl V 

u iy 


T1p 

1 1 c 


Trri Cor 
1 1 p JCI 


Plii 1 rr\ Con UH c Dn/\ 

biu irp oer his rro 








325 










^?0 


^5 
000 


Thr Ala 


Ala 


Ser 


Thr 

III! 


Prn 


Arn 
r\l y 


Cpt 
oci 


Gill 
to I u 


Arn Drn 
Ml y r 1 U 


PI \/ Orn Pl\/ PI v/ PI \/ 

biy rro biy biy biy 






340 














^50 
oou 


Val Cvs 


Glu 

VI 1 u 


Prn 

1 1 VJ 


Aro 
mi y 


Glv 
vi ly 


Gl V 

u iy 


to l u 


Drn 

riO 


Car Cop 


biy rro vai Arg Arg 




355 










OOU 






ODD 


Glu 1 pu 

VI 1 U LCD 


1 vs 
Lyo 


Gin 


Php 
rile 


1 Ol 1 


P] w 

u iy 


Tpr\ 

irp 


Leu 


Lys Lys 


mis Aia iyr uys oer 


370 










375 








380 


Asn Leu 


Ser 


Phe 


Arg 


Leu 


Tyr 


Asp 


Gin 


Trp Arg 


Ala Trp Met Gin Lys 


385 








390 








395 


400 


Ser His 


Lys 


Thr 


Arg 


Asn 


Gin 


Asp 


Glu 


Gly He 


Leu Pro Ser Gly Arg 








405 










410 


415 


Arg Gly 


Ala 


Ala 


Arg 


Gly 


Pro 


Ala 


Gly 










420 










425 







(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 259 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

AGGATTTCCT CTTTCAAGCC AAATACCAGA TCCGCTANCG AGTGGAGGAN AGTGTGGANT 60 

GGAAGGTGGT GGANGATGTG AGCAACCAGA CCTTCTGCCG CTGGNCGGCC TGAAACCCGG 120 

CANCGTGTAC TTCGTGCAAG TGCGCTGCAA NCCCTTTGGC ATCTATGGCT NCAAGAAAGC 180 

CGGGATCTNG AGTGAGTGGA GCCANCCCAC AGCCGGCTTC ANTTCCCGCA GTGAGCGNCN 240 

GGGCCCGGGN GGNGGGAAG 259 

(2) INFORMATION FOR SEQ ID N0:8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:8: 
CGCGAGCTCA AGCAGTTCCT G 21 
(2) INFORMATION FOR SEQ ID NO:9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 210 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: 

GNACACGCCG NTATAGCTNG CCCCTGCTGC TGCTGCTCTG CGTCCTCGGG GCNCGCGAGC 
GGATTCAGGA GCCCACACAG CTGTGATCAG TCCCCAGGAT CCCACGCTTC TCATCGGCTC 
CTCCCTGCTG GCCACCTGCT CAGTGCACGG AGACCCACCA GGAGCCACCG CCGAGGGCCT 
CTACTGGACC CTCAACGGGC GCCGCTGCCC 



60 
120 
180 
210 
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(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: Other 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:10: 
CCACCTAAGC TTGTACTTGA GG 22 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:ll: 
AGCCCACACA GCTGTGATCA G 21 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:12: 
GACAGAGCAC AGAATTCACT AGTGAGCTCT llllllllll TTTT 44 
(2) INFORMATION FOR SEQ ID NO: 13: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 
TATGGCCAGG ACAACACA 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:14 
ATAGGGCGTA AAGAGAGC 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 
TCACATCGTC CACCACCTTC CAGTCCA 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(0) TOPOLOGY: linear 
(ii) MOLECULE TYPE: Other 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:16: 
GA6CACAGAA TTCACTACTC GAGGCGGCCG Clllllllll lllllllll 49 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 388 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 



Gly 


Ser 


Gly 


Ala 


His 


Thr 


Ala 


Val 


He 


Ser 


Pro 


Gin 


Asp 


Pro Thr Leu 


1 








5 










10 






15 


Leu 


He 


Gly 


Ser 


Ser 


Leu 


Leu 


Ala 


Thr 


Cys 


Ser 


Val 


His 


Gly Asp Pro 








20 










25 










30 


Pro 


Gly 


Ala 


Thr 


Ala 


Glu 


Gly 


Leu 


Tyr 


Trp 


Thr 


Leu 


Xaa 


Gly Arg Arg 






35 










40 










45 


Leu 


Pro 


Pro 


Glu 


Leu 


Ser 


Arg 


Val 


Leu 


Asn 


Ala 


Ser 


Thr 


Leu Ala Leu 




50 










55 










60 






Ala 


Leu 


Ala 


Asn 


Leu 


Asn 


Gly 


Ser 


Arg 


Gin 


Arg 


Ser 


Gly 


Asp Asn Leu 


65 










70 










75 






80 


Val 


Cys 


His 


Ala 


Arg 


Asp 


Gly 


Ser 


He 


Leu 


Ala 


Gly 


Ser 


Cys Leu Tyr 










85 










90 








95 


Val 


Gly 


Leu 


Pro 


Pro 


Glu 


Lys 


Pro 


Val 


Asn 


He 


Ser 


Cys 


Trp Ser Lys 








100 










105 










110 


Asn 


Met 


Lys 


Asp 


Leu 


Thr 


Cys 


Arg 


Trp 


Thr 


Pro 


Gly 


Ala 


His Gly Glu 






115 










120 










125 




Thr 


Phe 


Leu 


His 


Thr 


Asn 


Tyr 


Ser 


Leu 


Lys 


Tyr 


Lys 


Leu 


Arg. Trp Tyr 




130 










135 










140 




Gly 


Gin 


Asp 


Asn 


Thr 


Cys 


Glu 


Glu 


Tyr 


His 


Thr 


Val 


Gly 


Pro His Ser 


145 










150 










155 




160 


Cys 


His 


He 


Pro 


Lys 


Asp 


Leu 


Ala 


Leu 


Phe 


Thr 


Pro 


Tyr 


Glu He Trp 










165 










170 








175 


Val 


Glu 


Ala 


Thr 


Asn 


Arg 


Leu 


Gly 


Ser 


Ala 


Arg 


Ser 


Asp 


Val Leu Thr 



180 185 190 
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Leu 


Asp 


He 


Leu 


Asp 


Val 


Val 

¥ U 1 


Thr 

Mil 


Thr 
1 1 ii 


Acn Prn Prn Prn 
mo p r i u ri u n u 


Glu Val 
u i u val 


Hi c 
n i o 






195 










200 

C—\J\J 




205 






Val 


Ser 


Ara 


Val 


Gly 


Glv 


1 PIJ 


Glu 


A^n 


Gin Lpu ^pr Val 


Arn Trn 
mi y iip 


Val 
va i 




210 










215 






220 




Ser 


Pro 


Pro 


Ala 


Leu 


Lyo 


nop 


Php 


1 PI I 


Php Gl n Ala 1 v<: 
ri ic u 1 1 1 Mid Ljro 


Tvr Gin 
lyr u 1 1 1 


T1p 
1 1 e 


225 


















9^ 
C.OD 






Ara 

ru y 


Tvr 


Ara 

ni y 


Val 


Glu 

\J 1 LI 


nop 


Cop 
Oct 


Val 
Vd l 


Mop 


Trn 1 vc \/al \/a1 
1 1 p Ljro val Val 


Mbp Mop 


vd 1 










245 
















Ser 


Asn 


Gin 

VJ 111 


Thr 
1 1 u 


^pr 




mi y 


1 01 1 


Al A 


aly LcU Ljo r 1 U 


Glu Thr 

u iy i nr 


Vd 1 








260 










lUJ 




970 
c/v 




Tvr 


Phe 


Val 

v a i 


Gin 


Val 

v a i 


Arn 


P vq 


Acn 
MM 1 


Prn 
ri U 


Php ft! v Tie* Tvr 
rric u iy 1 1 c iy r 


PI w Cor. 

b iy oer 


1 \1C 

Lys 






275 










2ft0 




COD 






Lvs 

LJO 


Ala 


Gl v 

u iy 


Tip 


Trn 
1 f p 


OCf 


Gin 


irp 


JCI 


Hi c Prn Thr Ala 
mo rr 0 1 rir M 1 a 


AT 

Ala Ser 


i nr 




290 
















OUU 






Pro 


Arn 

r\i y 


Spr 

OCT 1 


Glu 


Arn 


Prn 
ri u 


Gl\/ 
u iy 


r I U 


b iy 


m \/ m \/ A "In Pv/e 

b i y b i y a i a uys 


D r>/-\ 

blU rro 


Arg 


305 










7i n 








010 




oc\) 


Gl v 
a iy 


u iy 


Gill 


Prn 
ri U 


oer 


oer 


b iy 


rro 


va i 


Arts'* A O ") « » 1 nil 

Arg Arg biu Leu 


Lys bin 


Phe 




















ooU 


335 




Leu 


b iy 


Trp 


Leu 


Lys 


Lys 


HIS 


Ala 


Tyr 


Cys Ser Asn Leu 


Ser Phe 


Arg 








340 










345 




350 


Leu 


Tyr 


Asp 


Gin 


Trp 


Arg 


Ala 


Trp 


Met 


Gin Lys Ser His 


Lys Thr 


Arg 






355 










360 




365 




Asn 


Gin 


Asp 


Glu 


Gly 


He 


Leu 


Pro 


Ser 


Gly Arg Arg Gly 


Thr Ala 


Arg 




370 










375 






380 






Gly 


Pro 


Ala 


Arg 


















385 

























(2) INFORMATION FOR SEQ 10 N0:18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 392 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Gly Ser Gly Ala His Thr Ala Val He Ser Pro Gin Asp Pro Thr Leu 
1 5 10 15 

Leu He Gly Ser Ser Leu Leu Ala Thr Cys Ser Val His Gly Asp Pro 

20 25 30 

Pro Gly Ala Thr Ala Glu Gly Leu Tyr Trp Thr Leu Asn Gly Arg Arg 
35 40 45 
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Leu Pro Pro Glu Leu Ser 
50 

Ala Leu Ala Asn 
65 

Val Cys His Ala 



Val Gly Leu Pro 
100 

Asn Met Lys Asp 
115 

Thr Phe Leu His 
130 

Gly Gin Asp Asn 
145 

Cys His He Pro 

Val Glu Ala Thr 
180 

Leu Asp lie Leu 
195 

Val Ser Arg Val 
210 

Ser Pro Pro Ala 
225 

Arg Tyr Arg Val 

Ser Asn Gin Thr 
260 

Tyr Phe Val Gin 
275 

Lys Ala Gly He 
290 

Pro Arg Ser Glu 
305 

Gly Gly Glu Pro 

Leu Gly Trp Leu 
340 

Leu Tyr Asp Gin 
355 

Asn Gin His Arg 
370 

Arg Glu Val Leu 
385 



Leu Asn 
70 

Arg Asp 
85 

Pro Glu 



Leu Thr 

Thr Asn 

Thr Cys 
150 
Lys Asp 
165 

Asn Arg 

Asp Val 

Gly Gly 

Leu Lys 
230 
Glu Asp 
245 

Ser Cys 

Val Arg 

Trp Ser 

Arg Pro 
310 
Ser Ser 
325 

Lys Lys 

Trp Arg 

Thr Arg 

Pro Asp 
390 



Arg Val 
55 

Gly Ser 

Gly Ser 

Lys Pro 

Cys Arg 
120 
Tyr Ser 
135 

Glu Glu 

Leu Ala 

Leu Gly 

Val Thr 
200 

Leu Glu 
215 

Asp Phe 

Ser Val 

Arg Leu 

Cys Asn 
280 
Glu Trp 
295 

Gly Pro 

Gly Pro 

His Ala 

Ala Trp 
360 
Gly Ser 
375 

Lys Leu 



Leu Asn 

Arg Gin 

He Leu 
90 

Val Asn 
105 

Trp Thr 

Leu Lys 

Tyr His 

Leu Phe 
170 
Ser Ala 
185 

Thr Asp 

Asp Gin 

Leu Phe 

Asp Trp 
250 
Ala Gly 
265 

Pro Phe 

Ser His 

Gly Gly 

Val Arg 
330 
Tyr Cys 
345 

Met Gin 



Ala Ser 
60 

Arg Ser 
75 

Ala Gly 

He Ser 

Pro Gly 

Tyr Lys 
140 
Thr Val 
155 

Thr Pro 

Arg Ser 

Pro Pro 

Leu Ser 
220 
Gin Ala 
235 

Lys Val 

Leu Lys 

Gly He 

Pro Thr 
300 
Gly Ala 
315 

Arg Glu 
Ser Asn 
Lys Ser 



Thr Leu Ala Leu 
Gly Asp 
Ser Cys 



Cys Pro Arg Ala 
380 



Cys Trp 
110 
Ala His 
125 

Leu Arg 

Gly Pro 

Tyr Glu 

Asp Val 
190 
Pro Asp 
205 

Val Arg 

Lys Tyr 

Val Asp 

Pro Gly 
270 
Tyr Gly 
285 

Ala Ala 

Cys Glu 

Leu Lys 

Leu Ser 
350 
His Lys 
365 

Asp Gly 



Asn Leu 
80 

Leu Tyr 
95 

Ser Lys 



Gly Glu 

Trp Tyr 

His Ser 
160 
He Trp 
175 

Leu Thr 

Val His 

Trp Val 

Gin He 
240 
Asp Val 
255 

Thr Val 

Ser Lys 

Ser Thr 

Pro Arg 
320 
Gin Phe 
335 

Phe Arg 
Thr Arg 
Ala Arg 
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(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 385 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE 


DESCRIPTION: 


: SEQ ID 


NU . iy : 




Ala His 


Thr 


Ala 


Val 


He 


Ser 


Pro 


Gin 


Asp 


rro inr 


Leu Leu lie Gly 


1 ■ 






5 










10 




lb 


Ser Ser 


Leu 


His 


Ala 


Thr 


Cys 


Ser 


He 


His 


fll v Aer\ 

uiy Asp 


inr rro biy Ala 






20 










25 








Thr Ala 


Glu 


Gly 


Leu 


Tyr 


Trp 


Thr 


Leu 


Asn 


uiy Ary 


Arg Leu rro oer 




35 










40 










Glu Leu 


Ser 


Arg 


Leu 


Leu 


Asn 


Thr 


Ser 


Thr 


1 AM A "1 -J 

LcU A 1 d 


Leu Ala Leu Ala 


50 










55 








ou 




Asn Leu 


Asn 


Gly 


Ser 


Arg 


Gin 


Gin 


Ser 


Gly 


A c r\ Ac n 
rtbp Abrl 


Leu va i Lys MIS 


65 








70 










7^ 
/ o 


ou 


Ala Arg 


Asp 


Gly 


Ser 


He 


Leu 


Ala 


Gly 


Ser 


KsJO LcU 


\yi vd i u iy Leu 








85 










90 






Pro Pro 


Glu 


Lys 


Pro 


Phe 


Asn 


He 


Ser 


Cys 


Trn Spr 


Arn A^n Mp+ 1 

ni y mom mcl Ljro 






100 










105 






110 


Asp Leu 


Thr 


Cys 


Arg 


Trp 


Thr 


Pro 


Gly 


Ala 


His Gly 


Glu Thr Php 1 pij 




115 










120 








125 


His Thr 


Asn 


Tyr 


Ser 


Leu 


Lys 


Tyr 


Lys 


Leu 


Arg Trp 


Tyr Gly Gin Asp 


130 










135 








140 


Asn Thr 


Cys 


Glu 


Glu 


Tyr 


His 


Thr 


Val 


Gly 


Pro His 


Ser Cys His He 


145 








150 










155 


160 


Pro Lys 


Asp 


Leu 


Ala 


Leu 


Phe 


Thr 


Pro 


Tyr 


Glu He 


Trp Val Glu Ala 








165 










170 




175 


Thr Asn 


Arg 


Leu 


Gly 


Ser 


Ala 


Arg 


Ser 


Asp 


Val Leu 


Thr Leu Asp Val 






180 










185 






190 


Leu Asp 


Val 


Val 


Thr 


Thr 


Asp 


Pro 


Pro 


Pro 


Asp Val 


His Val Ser Arg 




195 










200 








205 


Val Gly 


Gly 


Leu 


Glu 


Asp 


Gin 


Leu 


Ser 


Val 


Arg Trp 


Val Ser Pro Pro 


210 










215 








220 




Ala Leu 


Lys 


Asp 


Phe 


Leu 


Phe 


Gin 


Ala 


Lys 


Tyr Gin 


lie Arg Tyr Arg 


225 








230 










235 


240 


Val Glu 


Asp 


Ser 


Val 


Asp 


Trp 


Lys 


Val 


Val 


Asp Asp 


Val Ser Asn Gin 








245 










250 


255 
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Thr Ser 


Cys Arg 


Leu 


Ala 


Gly 


Leu 


Lys 


Pro Gly 


Thr Val Tyr Phe Val 

llll » V* 1 1 J 1 III X^. T \Jk 1 




260 










265 




270 


Gin Val 


Arg Cys 


Asn 


Pro 


Phe 


Gly 


He 


Tyr Gly 


Ser Lys Lvs Ala Glv 




275 








280 






285 


He Trp 


Ser Glu 


Trp 


Ser 


His 


Pro 


Thr 


Ala Ala 


Ser Thr Pro Ara Spr 


290 








295 








300 


Glu Arg 


Pro Gly 


Pro 


Gly 


Gly 


Gly 


Val 


Cvs Glu 


Pro Ara Glv Glv Glu 


305 






310 








315 


320 


Pro Ser 


Ser Gly 


Pro 


Val 


Arq 


Ara 


Glu 


Leu Lys 


Gin Phe Leu Glv Trn 






325 










330 


335 


Leu Lys 


Lys His 


Ala 


Tyr 


Cys 


Ser 


Asn 


Leu Ser 


Phe Arg Leu Tyr Asp 




340 










345 




350 


Gin Trp 


Arg Ala 


Trp 


Met 


Gin 


Lys 


Ser 


His Lys 


Thr Arg Asn Gin Asp 




355 








360 






365 


Glu Gly 


He Leu 


Pro 


Ser 


Gly 


Arg 


Arg 


Gly Ala 


Ala Arg Gly Pro Ala 


370 








375 








380 



Gly 



385 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 385 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE 


TYPE: protein 








(xi) SEQUENCE 


DESCRIPTION: 


: SEQ ID 


N0:20: 


Ala His 


Thr Ala Val 


He Ser 


Pro 


Gin 


Asp 


Pro 


Thr Leu Leu He Gly 


1 


5 








10 




15 


Ser Ser 


Leu Leu Ala 


Thr Cys 


Ser 


Val 


His 


Gly 


Asp Pro Pro Gly Ala 




20 






25 






30 


Thr Ala 


Glu Gly Leu 


Tyr Trp 


Thr 


Leu 


Xaa 


Gly 


Arg Arg Leu Pro Pro 




35 




40 








45 


Glu Leu 


Ser Arg Val 


Leu Asn 


Ala 


Ser 


Thr 


Leu 


Ala Leu Ala Leu Ala 


50 




55 










60 


Asn Leu 


Asn Gly Ser 


Arg Gin 


Arg 


Ser 


Gly 


Asp 


Asn Leu Val Cys His 


65 




70 








75 


80 


Ala Arg 


Asp Gly Ser 


He Leu 


Ala 


Gly 


Ser 


Cys 


Leu Tyr Val Gly Leu 




85 








90 




95 


Pro Pro 


Glu Lys Pro 


Val Asn 


He 


Ser 


Cys 


Trp 


Ser Lys Asn Met Lys 




100 






105 






110 
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Asp Leu 


Thr 


Cys 


Arq 


TrD 


Thr 


Pro 


Glv 


Ala 


Hi Glv Glu Thr Php 1 pii 

1 1 1 j U 1 Jr UlU IIM rllC LCU 




115 










120 






125 


His Thr 


Asn 


Tyr 


Ser 


Leu 


Lvs 


Tvr 


1 v<; 


1 PU 


Arn Trn T\/r Gl \j Gin Acn 

Ml y II p 1 I Uljr ulfl Mop 


130 










135 








"]&() 


Asn Thr 


Cys 


Glu 


Glu 


Tvr 


His 


Thr 


Val 
va i 


Glv 
o ly 


Prn Hie Qor fvc Mi c Tlo 
ri u nib oci Ljrc) nib lie 


145 








150 












Pro Lys 


Asp 


Leu 


Ala 


Leu 


Php 


Thr 

Mil 




Tv/r 


Gin Tlo Trn £1ti Ala 
ulU lie irp vd 1 blU Mia 








165 










170 


1 7C 

1 / L) 


Thr Asn 


Arq 


Leu 


Glv 


Ser 


Ala 


Ara 


Spr 


A^n 


V^l 1 pii Thr 1 on Acn T "1 o 
va i lcu lilt Leu nip lie 






180 










185 






Leu Asp 


Val 


Val 


Thr 


Thr 


Asp 


Pro 


Pro 


Pro 


Glu VaI Hie Va! <;pr Arn 

uiu vai nib Va i jci rtry 




195 










200 






90^ 


Val Gly 


Gly 


Leu 


Glu 


Asp 


Gin 


t Pli 

LCU 


9pp 


Val 


Arn Trn Wal Qor Drn Dr>n 
Mi y I r p Va 1 ocT rTO ri 0 


210 










215 










Ala Leu 


Lvs 


Asp 


Phe 


1 pu 


Php 


Gin 

U II 1 


Ala 

M 1 u 


1 V/C 


iyr bin lie Arg iyr Arg 


225 




















Val Glu 


nop 




Val 

VG 1 


Acn 

nop 


Trn 

i rp 


1 vc 

Lys 


Val 
Vd 1 


Va 1 


Asp Asp va i ber Asn bin 








245 












COD 


Thr Ser 


Cvs 


Arg 


1 pi j 


Ala 


u \y 


t PI 1 


1 v<; 
Lyb 


Pro 
ri U 


Gl\/ Thr Wal T\/r Dha 

uiy inr vai iyr rne vai 






260 










265 




910 


Gin Val 


Arq 


Cys 


Asn 


Pro 


Phe 


Glv 


T1p 

1 1 C 


Tvr 
i y\ 


Glv 9or 1 vc 1 v/c AT a f^lv/ 
uiy ocr Lys Lyb Mia biy 




275 










280 






LUJ 


lie Trp 


Ser 


Glu 


TrD 


Ser 


His 


Pro 


Thr 


Ala 


Als ^pr Thr Prn Arn Cor 
Mia oci Mir n u mi y oci 


290 










295 








100 


Glu Arg 


Pro 


Gly 


Pro 


Glv 

U 1 J 


Glv 


Glv 

a \y 


Ala 




Gin Prn Arn Glv £1 v 
uiu riu mi y uiy uiy biu 


305 








310 










31 ^ ^90 


Pro Ser 


Ser 


Gly 


Pro 


Val 


Ara 


"i y 


Glu 


t PI 1 


1 vc Gin Dha 1 on /^T v/ Trn 

Ljb bin rne Leu uiy irp 








325 










330 


335 


Leu Lys 


Lys 


His 


Ala 


Tyr 


Cys 


Ser 


Asn 


Leu 


Ser Phe Arg Leu Tyr Asp 






340 










345 




350 


Gin Trp 


Arg 


Ala 


Trp 


Met 


Gin 


Lys 


Ser 


His 


Lys Thr Arg Asn 61 n Asp 




355 










360 






365 . 


Glu Gly 


He 


Leu 


Pro 


Ser 


Gly 


Arg 


Arg 


Gly 


Thr Ala Arg Gly Pro Ala 


370 










375 








380 



Arg 



385 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 303 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 



Ala 


His 


Thr 


Ala 


Val 


Tip 

1 1 c 


OCI 


Prn 
ri u 


Gin 


Acn Prn 
Mojj ri u 


Thr 1 pi i 

III) LCU 


1 pi i T 1 o Gl v 

lcu iic o iy 


1 








5 
















Sep 


Ser 


1 pu 

l_CLf 


I pu 


Ala 


Thr 

nil 


TVS 


Cap 
ocl 


va I 


Hie Glv 
n i o u iy 


Acn Drn 
Mop ri U 


Prn Ala 
rl U u iy Aid 








20 
















^n 


Thr 
1 1 1 1 


Ala 

r\ 1 □ 


Glu 




1 PI 1 

Leu 


Tvr 

iyr 


irp 


Thr 

i nr 


LcU 


Acn 1^1 w 

Asn biy 


Arg Arg 


Leu rro rTO 






35 










AO 






A^ 

HO 




Git j 

u I U 


1 PI 1 
LcU 


Cop 

ocr 


Arn 
Ml y 


Val 
va l 


LcU 


Msn 


AT a 

Aia 


oer 


inr Leu 


Ala Leu 


Aia Leu Ala 




^n 

OU 










DO 












Acn 


LcU 


Acn 
Mori 


Gl v 




Arg 


u i n 


Arg 


Ser 


b iy ASp 


Asn Leu 


val Lys His 












7n 








/D 




on 


Ala 

M 1 CI 


Arn 
mi y 


Acn 
Mop 


Gl v 

u iy 


Cor 
ocr 


1 1 c 


LcU 


Ala 

Aia 


u iy 


Ser Cys 


Leu iyr 


va 1 biy Leu 










OO 










90 






Prn 


Prn 


Glu 

U 1 u 


1 vs 
Lyo 


Prn 
ri U 


Val 


Moil 


J. 1c 


ocr 


tys i rp 


oer Lys 


Asn net Lys 








inn 










JLUO 






110 


Asn 


1 Pll 


Thr 


fvc 

L^O 


Arn 
mi y 


Trn 

irp 


Thr 
1 Ml 


Prn 
ri U 


b iy 


Ala Hie 
Aid nib 


b iy b 1 u 


i nr rne Leu 






115 










i?n 






i ?r 




His 


Thr 


no 1 1 


Tvr 


^pr 

OCI 


I pi i 

LCU 


i VC 

Lyo 


Tvr 

iyr 


I vc 

Lys 


1 oil Arn 
LcU Mf y 


irp iyr 


niv nin Acn 

biy bin Asp 




130 










13S 








140 


Asn 


Thr 


Cvs 

ujr o 


Glu 


Glu 


Tvr 
iyi 


Hi s 
n i o 


Thr 
1 1 u 


Val 
va i 


Glv Prn 
u iy r [ U 


c Cap 
n 1 o OCI 


fvc Mi c T1q 

uyo nib lie 


145 










150 














Pro 


Lvs 


Asp 


Leu 


Ala 


1 PU 


Phe 


Thr 


Prn 

r i u 


Tvr Glu 


T1p Trn 


V^l Gin Ala 

VOI U III Mlu 










165 










170 




17^ 


Thr 


Asn 


Am 


Leu 


Glv 


Ser 


Ala 


Ara 


Spr 


Asd Val 

/AO Lf VU 1 


1 pu Thr 

LCU 1 1 II 


\ pi i Acn T1p 

LCU r\OU 1 1 C 








180 










185 




190 


Leu 


Asp 


Val 


Val 


Thr 


Thr 


Asn 

no lv 


Pro 


Prn 


Pro Glu 


Val His 

v a i n i o 


Val ^\Pr Arn 
va i oci mi y 






195 










200 








Val 


61 y 


Glv 


Leu 


Glu 


no yj 


Gin 

U 1 1 J 


1 Pll 


^pr 

OCI 


Val Ara 

v a i i\\ y 


Trn Val 

lip v a i 


*\Pr Prn Prn 
oci n u r » u 




210 










215 








220 




Ala 


Leu 


Lys 


Asp 


Phe 


Leu 


Phe 


Gin 


Ala 


Lys Tyr 


Gin He 


Arg Tyr Arg 


225 










230 








235 




240 


Val 


Glu 


Asp 


Ser 


Val 


Asp 


Trp 


Lys 


Val 


Val Asp 


Asp Val 


Ser Asn Gin 










245 










250 




255 


Thr 


Ser 


Cys 


Arg 


Leu 


Ala 


Gly 


Leu 


Lys 


Pro Gly 


Thr Val 


Tyr Phe Val 








260 










265 






270 


Gin 


Val 


Arg 


Cys 


Asn 


Pro 


Phe 


Gly 


He 


Tyr Gly 


Ser Lys 


Lys Ala Gly 






275 










280 






285 




He 


Trp 


Ser 


Glu 


Trp 


Ser 


His 


Pro 


Thr 


Ala Ala 


Ser Thr 


Pro Arg 




290 










295 








300 



(2) INFORMATION FOR SEQ ID N0:22: 
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(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 389 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 



Ala 
M I a 


Mi c 
n i o 


Thr 

i nr 


Al a 
Mid 


Vd I 


i ie 


Ser 


pro 


b m 


Asp Pro 


Tit** 

i nr 


Leu Leu lie Gly 


1 
1 








C 
0 














15 


OCT 




LcU 


Leu 


Mia 


i nr 


Cue 

tys 


Ser 


va i 


HIS biy 


Asp 


Pro Pro Gly Ala 








?n 

c.\) 
















oil 


Thr 


Ala 


Gin 


u ty 


Leu 


lyr 


Trp 


I nr 


Leu 


Asn biy 


Arg 


Arg Leu Pro Pro 






oo 


















/it 


Glu 


1 pi i 


Cor 
JCl 


Arn 
Mi y 


Val 
Vd 1 


LcU 


Msn 


Al a 

m i a 


Ser 


mr Leu 


AT a 

ai a 


Leu Ala Leu Ala 




50 










DD 








fin 




Acn 

no 1 1 


1 pi i 


A^n 




Cop 

OCT 


Arn 
Mi y 


Gin 


Mrg 


C O f! 

JCl 


uiy Msp 


Asn 


Leu va i tys His 


65 










70 
f u 








7^ 
/o 




on 


Ala 


Am 


Acn 


Gl v 


Cor* 


Tip 


i oi i 

LcU 


Al a 
M 1 a 


b ly 


Ser Cys 


Leu 


iyr va i biy Leu 










85 
















Pro 


Pro 


Glu 


Lvs 


Pro 


Val 




TIP 

1 1 c 


Cpp 
Oct 


C uc Trn 


OCI 


Ljo MM I rlcL LjS 








100 










105 






no 


Asp 


Leu 


Thr 


Cys 


Ara 


Trn 
1 1 p 


Thr 


Pro 


Gl v 


Ala His 

r\ 1 O 1113 


Gl v 
u ly 


Gl n Thr Php 1 pi i 

U IU 1 M 1 r 1 ic LcU 






115 










120 








125 


His 


Thr 


Asn 


Tyr 


Ser 


Leu 


Lys 


Tyr 


Lys 


Leu Arg 


Trp 


Tyr Gly Gin Asp 




130 










135 








140 


Asn 


Thr 


Cys 


Glu 


Glu 


Tyr 


His 


Thr 


Val 


Gly Pro 


His 


Ser Cys His He 


145 










150 








155 




160 


Pro 


Lys 


Asp 


Leu 


Ala 


Leu 


Phe 


Thr 


Pro 


Tyr Glu 


He 


Trp Val Glu Ala 










165 










170 




175 


Thr 


Asn 


Arg 


Leu 


Gly 


Ser 


Ala 


Arg 


Ser 


Asp Val 


Leu 


Thr Leu Asp He 








180 










185 






190 


Leu 


Asp 


Val 


Val 


Thr 


Thr 


Asp 


Pro 


Pro 


Pro Asp 


Val 


His Val Ser Arg 






195 










200 








205 


Val 


Gly 


Gly 


Leu 


Glu 


Asp 


Gin 


Leu 


Ser 


Val Arg 


Trp 


Val Ser Pro Pro 




210 










215 








220 




Ala 


Leu 


Lys 


Asp 


Phe 


Leu 


Phe 


Gin 


Ala 


Lys Tyr 


Gin 


He Arg Tyr Arg 


225 










230 








235 




240 


Val 


Glu 


Asp 


Ser 


Val 


Asp 


Trp 


Lys 


Val 


Val Asp 


Asp 


Val Ser Asn Gin 










245 










250 




255 


Thr 


Ser 


Cys 


Arg 


Leu 


Ala 


Gly 


Leu 


Lys 


Pro Gly 


Thr 


Val Tyr Phe Val 



260 265 270 
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Gin Val 


Arg 


Cys 


Asn 


Pro 


Phe 


Gly He 


Tyr Gly Ser Lys Lys 


Ala Gly 




275 










280 


285 




He Trp 


Ser 


Glu 


Trp 


Ser 


His 


Pro Thr 


Ala Ala Ser Thr Pro 


Arg Ser 


290 










295 




300 


Glu Arg 


Pro 


Gly 


Pro 


Gly 


Gly 


Gly Ala 


Cvs Glu Pro Ara Glv 


Glv Glu 


305 








310 






315 


320 


Pro Ser 


Ser 


Gly 


Pro 


Val 


Arg 


Ara Glu 

ni y u 1 u 


1 pu 1 Gin Php 1 pi j 










325 








330 


335 


Leu Lys 


Lys 


His 


Ala 


Tyr 


Cys 


Ser Asn 


Leu Ser Phe Arg Leu 


Tyr Asp 






340 








345 


350 


Gin Trp 


Arg 


Ala 


Trp 


Met 


Gin 


Lys Ser 


His Lys Thr Arg Asn 


Gin His 




355 










350 


365 




Arg Thr 


Arg 


Gly 


Ser 


Cys 


Pro 


Arg Ala 


Asp Gly Ala Arg Arg 


Glu Val 


370 










375 




380 




Leu Pro 


Asp 


Lys 


Leu 













385 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 303 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

Ala His Thr Ala Val lie Ser Pro Gin Asp Pro Thr Leu Leu He Gly 
15 10 15 

Ser Ser Leu Leu Ala Thr Cys Ser Val His Gly Asp Pro Pro Gly Ala 

20 25 30 

Thr Ala Glu Gly Leu Tyr Trp Thr Leu Asn Gly Arg Arg Leu Pro Pro 

35 40 45 

Glu Leu Ser Arg Val Leu Asn Ala Ser Thr Leu Ala Leu Ala Leu Ala 

50 55 60 

Asn Leu Asn Gly Ser Arg Gin Arg Ser Gly Asp Asn Leu Val Cys His 
65 70 75 80 

Ala Arg Asp Gly Ser He Leu Ala Gly Ser Cys Leu Tyr Val Gly Leu 

85 90 95 

Pro Pro Glu Lys Pro Val Asn He Ser Cys Trp Ser Lys Asn Met Lys 

100 105 110 

Asp Leu Thr Cys Arg Trp Thr Pro Gly Ala His Gly Glu Thr Phe Leu 
115 120 125 
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His 


Thr 


Asn 


Tvr 


Ser 


Leu 


Lvs 


Tvr 


Lvs 


1 pu Ara Trn 
lcu mi y 1 1 p 


Tvr Hlv Gin Asn 

I Jr I vj I Jr ulll Mo p 




130 










135 






140 




Asn 


Thr 


Cys 


Glu 


Glu 


Tyr 


His 


Thr 


Val 

V U 1 


Glv Pro His 

VJ 1 Jr ri U III J 


^pr fvs His Tip 

•u C 1 \-» Jf O 11 1 O lit 


145 










150 








155 


160 


Pro 


Lys 


Asp 


Leu 


Ala 


1 pij 


Phe 


Thr 
1 1 1 1 


Prn 


Tvr Gin Tip 

1 Jr 1 \J 1 u lie 


Trn Val Glu Ala 

lip vol UIU nlu 










165 










170 


17S 


Thr 


Asn 


Ara 

ni y 


Leu 


Glv 


Spr 


Ala 


Am 

mi y 


xPP 


Asn Val 1 pu 
Mop va i lcu 


Thr 1 pii Acn Tlo 

1 1 II LCU Mop 1 1 c 








180 










185 




190 


Leu 


Asp 


Val 


Val 


Thr 


Thr 


Asn 

nJ p 


Prn 

r i \j 


Prn 


Prn Asn Val 

r l U Mo p Val 


Hi c \/al Spr Am 
n i o v a i oci mi y 






195 










200 






205 


Val 


61 y 


Glv 


Leu 


Glu 


Asn 


Gin 

U 1 II 


1 Pii 

1— C» 14 


Spr 

JCI 


Val Ara Trn 

v u i r\\ y i i u 


V/al ^pr Prn Prn 
va i jci n u r i u 




210 










215 






220 




Ala 


Leu 


Lvs 


Asd 


Phe 

1 1 IG 


1 pij 


Php 


Gin 

vj 1 1 1 


Ala 

M 1 a 


1 vq Tvr fnl n 

Ljo Ijr 1 Ulll 


Tip Arn Tv/r Ar*n 
lie ni y i y i mi g 


225 










230 










240 


Val 


Glu 

VJ 1 \A 


Asn 


*\Pr 
%}CI 


Val 


Acn 
Mop 


irp 


1 vc 

Lys 


va I 


Acrv Acn 
Va 1 Mop Mop 


va i oer Msn o i n 










245 










250 


255 


Thr 


Ser 


Cys 


Arg 


Leu 


Ala 


Gly 


Leu 


Lys 


Pro Gly Thr 


Val Tyr Phe Val 








260 










265 




270 


Gin 


Val 


Arg 


Cys 


Asn 


Pro 


Phe 


Gly 


He 


Tyr Gly Ser 


Lys Lys Ala Gly 






275 










280 






285 


He 


Trp 


Ser 


Glu 


Trp 


Ser 


His 


Pro 


Thr 


Ala Ala Ser 


Thr Pro Arg 




290 










295 






300 



(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 389 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 





(xi) SEQUENCE 


DESCRIPTION: 


: SEQ ID 


N0:24: 




Ala 


His Thr Ala Val 


He Ser Pro 


Gin Asp 


Pro Thr 


Leu Leu He Gly 


1 


5 




10 




15 


Ser 


Ser Leu Leu Ala 


Thr Cys Ser 


Val His 


Gly Asp 


Pro Pro Gly Ala 




20 




25 




30 


Thr 


Ala Glu Gly Leu 


Tyr Trp Thr 


Leu Asn 


Gly Arg 


Arg Leu Pro Pro 




35 


40 






45 


Glu 


Leu Ser Arg Val 


Leu Asn Ala 


Ser Thr 


Leu Ala 


Leu Ala Leu Ala 




50 


55 




60 




Asn 


Leu Asn Gly Ser 


Arg Gin Arg 


Ser Gly 


Asp Asn 


Leu Val Cys His 


65 




70 




75 


80 
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Ala Arg 


Asp 


Gly 


Ser 


He 


Leu 


Ala 


Gly 


Ser Cys Leu Tyr Val 


Gly Leu 








85 










90 


95 


Pro Pro 


Glu 


Lys 


Pro 


Val 


Asn 


He 


Ser 


Cys Trp Ser Lys Asn 


Met Lys 






100 










105 


110 


Asp Leu 


Thr 


Cys 


Arg 


Trp 


Thr 


Pro 


Gly 


Ser His Gly Glu Thr 


Phe Leu 




115 










120 




125 




His Thr 


Asn 


Tyr 


Ser 


Leu 


Lys 


Tyr 


Lys 

■J 


Leu Arq Trp Tyr Glv 


Gin Asd 


130 










135 






140 


Asn Thr 


Cys 


Glu 


Glu 


Tyr 


His 


Thr 


Val 


Gly Pro His Ser Cvs 


His He 


145 








150 








155 


160 


Pro Lys 


Asp 


Leu 


Ala 


Leu 


Phe 


Thr 


Pro 


Tvr Glu lie Tro Val 


Glu Ala 








165 










170 


175 


Thr Asn 


Arg 


Leu 


Gly 


Ser 


Ala 


Arq 


Ser 


Asp Val Leu Thr Leu 


A^n Tip 






180 










185 


190 


Leu Asp 


Val 


Val 


Thr 


Thr 


Asp 


Pro 


Pro 


Pro Asd Val His Val 


Spr Ara 




195 










200 




205 


Val Gly 


Gly 


Leu 


Glu 


Asp 


Gin 


Leu 


Ser 


Val Ara Tro Val Ser 


Prn Prn 


210 










215 






220 




Ala Leu 


Lys 


Asp 


Phe 


Leu 


Phe 


Gin 


Ala 


Lvs Tvr Gin He Ara 

_/ ' J > will X 1 \* l\\ M 


Tvr Arn 


225 








230 








235 


240 


Val Glu 


Asp 


Ser 


Val 


Asd 


Trp 


Lys 


Val 


Val Asd Asd Val Spr 


Aqn Gin 

noi i u III 








245 










250 


255 


Thr Ser 


Cys 


Arg 


Leu 


Ala 


Gly 


Leu 


Lys 


Pro Gly Thr Val Tyr 


Phe Val 






260 










265 


270 




Gin Val 


Arg 


Cys 


Asn 


Pro 


Phe 


Gly 


He 


Tyr Gly Ser Lys Lys 


Ala Gly 




275 










280 




285 


He Trp 


Ser 


Glu 


Trp 


Ser 


His 


Pro 


Thr 


Ala Ala Ser Thr Pro 


Arg Ser 


290 










295 






300 


Glu Arg 


Pro 


Gly 


Pro 


Gly 


Gly 


Gly 


Ala 


Cys Glu Pro Arg Gly 


Gly Glu 


305 








310 








315 


320 


Pro Ser 


Ser 


Gly 


Pro 


Val 


Arq 


Arq 


Glu 


Leu Lvs Gin Phe Leu 


Glv Trn 








325 










330 


335 


Leu Lys 


Lys 


His 


Ala 


Tyr 


Cys 


Ser 


Asn 


Leu Ser Phe Arg Leu 


Tyr Asp 






340 










345 


350 


Gin Trp 


Arg 


Ala 


Trp 


Met 


Gin 


Lys 


Ser 


His Lys Thr Arg Asn 


Gin His 




355 










360 




365 




Arg Thr 


Arg 


Gly 


Ser 


Cys 


Pro 


Arg 


Ala 


Asp Gly Ala Arg Arg 


Glu Val 


370 










375 






380 




Leu Pro 


Asp 


Lys 


Leu 















385 



(2) INFORMATION FOR SEQ ID N0:25: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 389 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:25: 



Ala 


His 

1 1 1 o 


Thr 
1 1 1 1 


Ala 


Val 

V U 1 


Tip 

1 1 c 


JCI 


Prn 
ri u 


Gin 

DIM 


Acn Drn 
nop ri U 


Thr* 1 qi i 

inr LcU 


Leu lie b ly 


1 








5 










10 




15 


<spr 


^pr 

OCl 


LcU 


1 PI l 


Ala 
M 1 d 


I fir 


f vc 


Coo 

oer 


Vd 1 


US c £lw 

nl S b ly 


ASp rTO 


rro biy Ala 








90 










CO 






oU 


Thr 


Ala 


Gin 


Gl V 




Tv/r* 

tyr 


irp 


i nr 


Leu 


Asn u ly 


Arg Arg 


Leu Pro Pro 
















HU 






45 




Glu 


1 PI 1 


Car 


Am 
mi y 


vd 1 


LcU 


Acn 


Al a 
M 1 d 


OCI 


Tnr* 1 ni i 

i nr LcU 


Al a Leu 


Aia Leu Ala 




50 


















fin 




noi i 


1 pii 


Asn 


Gl V 


Spr 

OCI 


Am 
ni y 


Gin 


Am 
Mi y 


Cop 


u 1 y Mi p 


asm Leu 


Pwc Uic 

va i tys HIS 


65 










70 








/ J 




80 


Ala 


Arg 


Asp 


Gl V 
u i y 


^pr 


He 

A 1 C 


1 PM 


Ala 

r\ 1 Q 


Gl V 

o i y 


OCT Ljo 


LcU IjfT 


\/ai m v i oi i 
vd i u ly Leu 










85 










90 






Pro 


Pro 


Glu 


Lvs 


Pro 


V U 1 


Asn 


Tip 

1 1 c 


<\pr 

OCI 


P \/q Trn 
UJro 1 1 p 


Cor* 1 \jc 
ocl Ljo 


Acn Mof 1 v/r 

Asn net Lys 








100 










105 






110 


Asp 


Leu 


Thr 


Cvs 


Ara 

r\i y 


Trn 


Thr 


Prn 


Gl v 

U 1 Jf 


Ala His 
n I d n I o 


Glv Gill 

o \y \j\u 


mr rne Leu 






115 










120 






12S 




His 


Thr 


Asn 


Tvr 


Ser 


Leu 


Lvs 


Tvr 


Lvs 


1 pi J Arn 


Trn Tvr 


Gl v Gin Acn 
U i jr u 1 1 1 Mo p 




130 










135 








140 


Asn 


Thr 


Cys 


Glu 


Asd 


Tyr 


His 


Thr 


Val 


Gly Pro 


His Ser 


Cvs His Tip 

V* Jf O 1 1 1 o lie 


145 










150 








155 




160 


Pro 


Lys 


Asp 


Leu 


Ala 


Leu 


Phe 


Thr 


Pro 


Tyr Glu 


Tip Trn 

1IC 1 1 IJ 


Val Glu Ala 

V □ 1 U IU r\ 1 Q 










165 










170 


175 


Thr 


Asn 


Arg 


Leu 


Gly 


Ser 


Ala 


Arg 


Ser 


Asp Val 


Leu Thr 


Leu Asp He 








180 










185 






190 


Leu 


Asp 


Val 


Val 


Thr 


Thr 


Asp 


Pro 


Pro 


Pro Asp 


Val His 


Val Ser Arg 






195 










200 






205 


Val 


Gly 


Gly 


Leu 


Glu 


Asp 


Gin 


Leu 


Ser 


Val Arg 


Trp Val 


Ser Pro Pro 




210 










215 








220 




Ala 


Leu 


Lys 


Asp 


Phe 


Leu 


Phe 


Gin 


Ala 


Lys Tyr 


Gin He 


Arg Tyr Arg 


225 










230 








235 




240 


Val 


Glu 


Asp 


Ser 


Val 


Asp 


Trp 


Lys 


Val 


Val Asp 


Asp Val 


Ser Asn Gin 










245 










250 




255 


Thr 


Ser 


Cys 


Arg 


Leu 


Ala 


Gly 


Leu 


Lys 


Pro Gly 


Thr Val 


Tyr Phe Val 








260 










265 






270 


Gin 


Val 


Arg 


Cys 


Asn 


Pro 


Phe 


Gly 


He 


Tyr Gly 


Ser Lys 


Lys Ala Gly 






275 










280 






285 
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lie Trp 


Ser 


Glu 




^pr 


n i o 


Prn 
ri U 


290 










295 




Glu Arg 


Pro 


Gly 


Pro 


Glv 


Glv 


Glv 


305 








310 






Pro Ser 


Ser 


Glv 


Prn 


Val 


Arn: 
mi y 


Arn 
mi y 


1 01 1 1 we 




n i b 
340 


Al a 
M 1 a 


lyr 


tys 


Ser 


Gin Trp 


Arg 
355 


Ala 


Trp 


Met 


Gin 


Lys 
360 


Arg Thr 


Arg 


Gly 


Ser 


Cys 


Pro 


Arg 


370 










375 




Leu Pro 


Asp 


Lys 


Leu 








385 















Thr 


Ala Al a ^tor* 
Mid Mid jci 


inr rro Arg oer 






Ala 
M 1 a 


Cue nin Prn 
i>yi> u 1 u r i u 


Arn £1 v (Z*\\r Pin 

™y b iy b ty b IU 




315 




b 1 U 


1 qi i 1 \tc Pin 

Leu Lys b I n 


rne Leu biy irp 




330 


335 


Asn 


Leu Ser Phe 


Arg Leu Tyr Asp 


345 




350 


Ser 


His Lys Thr 


Arg Asn Gin His 






365 


Ala 


Asp Gly Ala 


Arg Arg Glu Val 




380 





(2) INFORMATION FOR SEQ ID N0:26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 389 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:26: 

Ala His Thr Ala Val He Ser Pro Gin Asp Pro Thr Leu Leu He Gly 
15 10 15 

Ser Ser Leu Leu Ala Thr Cys Ser Val His Gly Asp Pro Pro Gly Ala 

20 25 30 

Thr Ala Glu Gly Leu Tyr Trp Thr Leu Asn Gly Arg Arg Leu Pro Pro 

35 40 45 

Glu Leu Ser Arg Val Leu Asn Ala Ser Thr Leu Ala Leu Ala Leu Ala 

50 55 60 

Asn Leu Asn Gly Ser Arg Gin Arg Ser Gly Asp Asn Leu Val Cys His 
65 70 75 80 

Ala Arg Asp Gly Ser lie Leu Ala Gly Ser Cys Leu Tyr Val Gly Leu 

85 90 95 

Pro Pro Glu Lys Pro Val Asn lie Ser Cys Trp Ser Lys Asn Met Lys 

100 105 110 

Asp Leu Thr Cys Arg Trp Thr Pro Gly Ala His Gly Glu Thr Phe Leu 

115 120 125 

His Thr Asn Tyr Ser Leu Lys Tyr Lys Leu Arg Trp Tyr Gly Gin Asp 
130 135 140 
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Asn 


Thr 


Cys 


Glu 


Glu 


Tvr 
iyi 


Hie 


Thr 
1 1 ii 


Val 
vd I 


Gl \t Prn Hie Cor C\ta Hi c 

uiy ri u nib ocr uyb nis 


np 

i Ic 


145 










150 








1 nn 


1 fin 


Pro 


Lvs 


Asp 


Lpu 


Thr 
tin 


LCU 


Php 
n ic 


Thr 

Mil 


Prn 
rf u 


Tvr Gin T1p Trn Val Gin 
iyi uiu lie irp vol uiu 


Ala 
A 1 d 










165 










170 17R 




Thr 


Asn 


Arg 


Leu 


Gl v 


^pr 

OCI 


Ala 


Arn 
Mr y 


Cop 


Acn \/al I pi i Thr 1 pii Acn 
Mop Vd 1 LcU 1 111 LcU Mbp 


Tip 
1 Ic 








180 










185 


1Q0 




Leu 


Asp 


Val 


Val 


Thr 


Thr 

Mil 


Acn 

no p 


Prn 


Prn 

rf u 


Prn Acn Wi c Ual Cor 
ri u Mop vd I nib Vd l jci 


Arg 






195 










200 




90^ 


Val 


Gl v 

u i y 


Gl v 


1 Pll 


Glu 

VJ I u 


Acn 
nop 


Gin 

VJ 1 l 1 


1 PI 1 
LCU 


Cop 
OCI 


\/a 1 Arn Trn Wal Cor Drn 
va i Mi y lip val jcr rTO 


rrO 




210 










21^ 






990 




Ala 


1 PI J 




Acn 

r\op 


Php 
r i ic 


1 Pi i 
Leu 


Php 
r lie 


nin 

u 1 U 


AT a 
M 1 a 


Lys lyr □ in lie Arg lyr 


Arg 


99^ 




















oa n 


Val 

V a 1 


Glu 

u i u 


Acn 

r\op 


Cop 
OCI 


Val 
va i 


Acn 
Mbp 


Trn 

irp 


1 vc 

Lys 


Va 1 


va i Msp Asp va i oer Asn 


bin 










245 










9^0 oc;c; 




Thr 




L>jro 


Am 

r\\ y 


1 pi i 


Ala 


Gl v 


1 PI 1 
LcU 


Lys 


Prn v Thr Twn Dho 

ft u uiy i nr vai lyr rne 


va i 








260 










2fiS 

L.UJ 


CI u 




Gin 


Val 


Aro 

m y 


Tvc 


Acn 

r^o 1 1 


Prn 
r i kj 


Php 

r 1 IC 


Gl v 
u iy 


T1p 
1 I c 


T\/r CI w Cap 1 \/c 1 \/c Al n 

iyr uiy ocr Lys Lys Ala 


b iy 






275 










PRO 




coo 


Tip 

1 1 c 


i r p 


OCI 


Gin 


Trn 

irp 


Cop 
JCI 


Mi c 


Prn 
ri U 


Thr 


Ala Aia oer inr rro Arg 


Ser 




290 










9Q^ 






oUU 




Glu 


Arn 
mi y 


Prn 
rr u 


u iy 


rf U 


Gl V 

u iy 


i3 iy 


u iy 


Al a 
M I a 


Lys uiu rro Arg biy biy 


blU 


305 










^10 








0X3 


T*?n 


Prn 
n u 


jci 


OCI 


Gl v 


Prn 


Vd 1 


Arn 


A rn 

Arg 


Li 1 U 


Leu Lys bin rne Leu biy 


Trp 










325 










^n 


Leu 


Lys 


Lys 


His 


Ala 


Tyr 


Cys 


Ser 


Asn 


Leu Ser Phe Arg Leu Tyr 


Asp 








340 










345 


350 


Gin 


Trp 


Arg 


Ala 


Trp 


Met 


Gin 


Lys 


Ser 


His Lys Thr Arg Asn Gin 


His 






355 










360 




365 




Arg 


Thr 


Arg 


Gly 


Ser 


Cys 


Pro 


Arg 


Ala 


Asp Gly Ala Arg Arg Glu 


Val 




370 










375 






380 




Leu 


Pro 


Asp 


Lys 


Leu 















385 



(2) INFORMATION FOR SEQ ID N0:27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 389 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
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Ala His 


Thr 


Ala 


Val 

V U 1 


He 

J. I 


Spr 


Prn 
ri u 


1 






5 








Ser Ser 


Leu 

l_ t m 


L PU 


Ala 


Thr 


Pvs 


Sop 






20 










Thr Ala 


Glu 


Glv 


1 Pi I 


Tvr 

iyr 


Trn 

irp 


Thr 
1 in 




35 










AO 


Glu Leu 


S°r 


Aro 

ni y 


Val 

V U 1 


l pi i 

LCU 


Acn 


Ala 


50 










55 




Asn Leu 


Asn 


Glv 


Spr 


Ara 


Gin 


Arn 
mi y 


65 








70 






Al a Ara 


Asn 


Glv 


OCI 


Tie 

1 1 c 


1 PI 1 

LcU 


Ala 
M I a 








oo 








Prn Prn 


Gill 


Lys 


rr U 


Val 
va 1 


Msn 


i ie 






inn 












Thr 


C vc 
W>> 


Arn 


irp 


Thr 

i nr 


Drn 

rTO 




IIS 










1 90 
lev 


His Thr 

1 1 1 O I 1 II 


Acn 
no 1 1 


Tv/r 

iyr 


Cap 
JCI 


LcU 


Lys 


Tyr 


130 










135 




Asn Thr 


P VS 


U 1 U 


Glu 

U 1 u 


Tvr 

iyr 


Hi q 
n t o 


Thr 


145 














Pro Lys 


Asn 

now 


1 Pli 


Ala 

r\ I a 


1 PI 1 
LCU 


Php 
ri ic 


Thr 
I nr 
















Thr Asn 


Am 


1 PI I 


u i y 


Cor 
JCI 


Cor 1 


arg 






180 










Lpu Asn 

no 


Val 

V CI 1 


Val 

v u i 


Thr 

III! 


Thr 
nil 


Acn 
Mop 


Prn 




195 








200 


Val Gly 


Glv 


Leu 


Glu 


Asn 


Gin 

LI 1 1 1 


1 PI 1 

LcU 


210 










215 




Ala Leu 


Lys 


Asp 


Phe 


Leu 


Phe 


Gin 

VJ 1 1 1 


225 








230 






Val Glu 


Asp 


Ser 


Val 


Asn 


Trn 
i r \> 


1 VQ 

Lyo 








245 








Thr Ser 


Cvs 
^ y -j 


Ara 

rw y 


1 pii 


Ala 


Glv 
u \y 


1 PI 1 
LCU 






260 










Gin Val 


Arg 


Cvs 


Asn 

no 1 1 


Prn 

r i vj 


Php 


Glv 

U Ijf 




275 










280 


He Trp 


Ser 


Glu 


Trp 


Ser 


His 


Pro 


290 










295 




Glu Arg 


Pro 


Gly 


Pro 


Gly 


Gly 


Gly 


305 








310 






Pro Ser 


Ser 


Gly 


Pro 


Val 


Arg 


Arg 








325 








Leu Lys 


Lys 


His 


Ala 


Tyr 


Cys 


Ser 



340 





94 




Gin 


Asp Pro 


Thr Leu Leu He Gly 




10 


15 


Val 


His Gly 


Asp Pro Pro Gly Ala 


25 




30 


Leu 


Asn Gly 


Arg Arg Leu Pro Pro 






45 


Ser 


Thr Leu 


Ala Leu Ala Leu Ala 






60 


Ser 


Gly Asp 


Asn Leu Val Cys His 




75 


80 


Gly 


Ser Cys 


Leu Tyr Val Gly Leu 




90 


95 


Ser 


Cys Trp 


Ser Lys Asn Met Lys 


105 




110 


Gly 


Ala His 


Gly Glu Thr Phe Leu 






125 


Lys 


Leu Arg 


Trp Tyr Gly Gin Asp 






140 


Val 


Gly Pro 


His Ser Cys His He 




155 


160 


Pro 


Tyr Glu 


He Trp Val Glu Ala 




170 


175 


Ser 


Asp Val 


Leu Thr Leu Asp He 


185 




190 


Pro 


Pro Asp 


Val His Val Ser Arg 






205 


Ser 


Val Arg 


Trp Val Ser Pro Pro 






220 


Ala 


Lys Tyr 


Gin He Arg Tyr Arg 




235 


240 


Val 


Val Asp 


Asp Val Ser Asn Gin 




250 


255 


Lys 


Pro Gly 


Thr Val Tyr Phe Val 


265 




270 


He 


Tyr Gly 


Ser Lys Lys Ala Gly 






285 


Thr 


Ala Ala 


Ser Thr Pro Arg Ser 






300 


Ala 


Cys Glu 


Pro Arg Gly Gly Glu 




315 


320 


Glu 


Leu Lys 


Gin Phe Leu Gly Trp 




330 


335 


Asn 


Leu Ser 


Phe Arg Leu Tyr Asp 


345 




350 
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95 



Gin Trp Arg Ala Trp Met Gin Lys Ser 

355 360 
Arg Thr Arg Gly Ser Cys Pro Arg Ala 

370 375 
Leu Pro Asp Lys Leu 
385 

(2) INFORMATION FOR SEQ ID NO:28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 389 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:28: 



Ala 


His 


Thr 


Ala 


Val 


He 


Ser 


Pro 


Gin 


Asp 


Pro 


Thr 


Leu Leu He Gly 


1 








5 










10 






15 


Ser 


Ser 


Leu 


Leu 


Ala 


Thr 


Cys 


Ser 


Val 


His 


Gly 


Asp 


Pro Pro Gly Ala 








20 










25 








30 


Thr 


Ala 


Glu 


Gly 


Leu 


Tyr 


Trp 


Thr 


Leu 


Asn 


Gly 


Arg 


Arg Leu Pro Pro 






35 










40 










45 


Glu 


Leu 


Ser 


Arg 


Val 


Leu 


Asn 


Ala 


Ser 


Thr 


Leu 


Ala 


Leu Ala Leu Ala 




50 










55 










60 




Asn 


Leu 


Asn 


Gly 


Ser 


Arg 


Gin 


Arg 


Ser 


Gly 


Asp 


Asn 


Leu Val Cys His 


65 










70 










75 




80 


Ala 


Arg 


Asp 


Gly 


Ser 


He 


Leu 


Ala 


Gly 


Ser 


Cys 


Leu 


Tyr Val Gly Leu 










85 










90 






95 


Pro 


Pro 


Glu 


Lys 


Pro 


Val 


Asn 


He 


Ser 


Cys 


Trp 


Ser 


Lys Asn Met Lys 








100 










105 








110 


Asp 


Leu 


Thr 


Cys 


Arg 


Trp 


Thr 


Pro 


Gly 


Ala 


His 


Gly 


Glu Thr Phe Leu 






115 










120 










125 


His 


Thr 


Asn 


Tyr 


Ser 


Leu 


Lys 


Tyr 


Lys 


Leu 


Arg 


Trp 


Tyr Gly Gin Asp 




130 










135 










140 


Asn 


Thr 


Cys 


Glu 


Glu 


Tyr 


His 


Thr 


Val 


Gly 


Pro 


His 


Ser Cys His He 


145 










150 










155 




160 


Pro 


Lys 


Asp 


Leu 


Ala 


Leu 


Phe 


Thr 


Pro 


Tyr 


Glu 


He 


Trp Val Glu Ala 










165 










170 






175 


Thr 


Asn 


Arg 


Leu 


Gly 


Ser 


Ala 


Arg 


Ser 


Asp 


Val 


Leu 


Thr Leu Asp Val 








180 










185 








190 


Leu 


Asp 


Val 


Val 


Thr 


Thr 


Asp 


Pro 


Pro 


Pro 


Asp 


Val 


His Val Ser Arg 



195 200 205 



His Lys Thr Arg Asn Gin His 
365 

Asp Gly Ala Arg Arg Glu Val 
380 
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Val 


Gly 


Gly 


Leu 


Glu 


Asp 


Gin 


Leu 


Ser 


Val Arc TrD 


Val Spr Pro Pro 




210 










215 






220 




Ala 


Leu 


Lys 


Asp 


Phe 


Leu 


Phe 


Gin 


Ala 


Lvs Tvr Gin 


Hp Ara Tvr Ara 


225 










230 








235 


240 


Val 


Glu 


Asp 


Ser 


Val 


Asp 


TrD 


Lvs 


Val 


Val Asd Asd 


Val Ser Asn Gin 

V U 1 Jtl /\0 1 1 \J 1 1 1 










245 










250 


255 


Thr 


Ser 


Cys 


ArQ 


Leu 


Ala 


Gly 


Leu 




Pro Glv Thr 


Val Tvr Php Val 

vui ijri r i ic vol 








260 










265 




270 


Gin 


Val 


Ara 


Cvs 


Asn 


Pro 


Phe 


Gl v 


He 

lie 


Tvr Glv ^pr 


1 v<; 1 \zc Al a Gl \/ 
LJro LJro Mid uljr 






275 










280 






28S 


lie 


TfD 


Ser 


Glu 


TrD 


Ser 


His 


Pro 


Thr 


Ala Ala ^pr 


Thr Prn Arn Qor 
1 1 n ri u Mi y oci 




290 










295 






300 


Glu 


Arg 


Pro 


Gl v 


Pro 


Gl v 


Gl v 


Gl v 


Ala 

r\ 1 Q 


Pv<; Gin Prn 


Arn Gl v Gl \/ Glu 

mi y u iy u iy o i u 


305 










310 








11 5 


O/lU 


Pro 


Spp 


Spp 


Gl v 

u ljf 


Prn 

r i u 


VQ 1 


Am 
mi y 


Arn 
Mi y 


Gill 


Ian 1 \/c filn 
LcU Ljo u 1 rl 


rne Leu biy irp 










325 










330 




Leu 


Lys 


Lys 


His 


Ala 


Tyr 


Cys 


Ser 


Asn 


Leu Ser Phe 


Arg Leu Tyr Asp 








340 










345 




350 


Gin 


Trp 


Arg 


Ala 


Trp 


Met 


Gin 


Lys 


Ser 


His Lys Thr 


Arg Asn Gin His 






355 










360 






365 


Arg 


Thr 


Arg 


Gly 


Ser 


Cys 


Pro 


Arg 


Ala 


Asp Gly Ala 


Arg Arg Glu Val 




370 










375 






380 




Leu 


Pro 


Asp 


Lys 


Leu 
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(2) INFORMATION FOR SEQ ID N0:29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 389 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:29: 

Ala His Thr Ala Val He Ser Pro Gin Asp Pro Thr Leu Leu He Gly 
15 10 15 

Ser Ser Leu Leu Ala Thr Cys Ser Val His Gly Asp Pro Pro Gly Ala 

20 25 30 

Thr Ala Glu Gly Leu Tyr Trp Thr Leu Asn Gly Arg Arg Leu Pro Pro 

35 40 45 

Glu Leu Ser Arg Val Leu Asn Ala Ser Thr Leu Ala Leu Ala Leu Ala 
50 55 60 
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Asn 


Leu 


Asn 


Glv 


Ser 


Arg 


Gin 


Ara 

r\i y 


Ser 


Glv Asn Asri 


1 PU \/al rv/c Mi c 
LCU vd 1 Uyb n 1 b 


65 










70 








75 


80 


Ala 


Ara 


Asp 


Gly 


Ser 


He 


Lpu 


Ala 


Glv 

u ly 


Spr f vs 1 pij 

v Jr O LCU 


Tvr Val f!l v 1 on 

l Jrl Vd 1 u iy Leu 










85 










90 




Pro 


Pro 


Glu 


Lys 


Pro 


Val 


Asn 


He 


Ser 


fvs Trn Spr 


1 vs A^n Mpf 1 vc 

Ljr«j r\oll MCL LJro 








100 










105 




110 


Asp 


Leu 


Thr 


Cys 


Ara 

y 


TrD 


Thr 


Pro 


Glv 


Ala His Glv 


Glu Thr Php 1 pm 
uiu 1 1 ii r i ic LCU 






115 










120 






125 


His 


Thr 


Asn 


Tvr 


Ser 


Leu 


1 v^ 


Tvr 
iyt 


Lyb 


1 pm Arn Trn 


Tvr ftl v ftln Acn 
1 jr 1 u 1 Jr u 1 11 Mb p 




130 










135 








Asn 


Thr 


Cvs 


Glu 


Glu 

\J 1 U 


Tvr 
iyr 


His 

n i o 


Thr 
1 1 ii 


Val 

Vu 1 


Glv Prn Hie 
u i jr r i u nib 


Cop rue Ui c TTo 

oer tys m s lie 


145 










1 so 

±o\j 










lou 


Pro 


Lvs 


A^n 


1 PIJ 


Ala 


1 pi i 


Php 

rl IC 


Thr 

1 I IT 


Prn 
ri U 


Tvr Gin Tip 
I yr uiu lie 


Trn Wal r;i 1 1 Ala 

i rp va i uiu ai a 










165 










170 


I/O 


Thr 


Asn 


Arg 


Leu 


Glv 

U 1 Jf 


Spr 


Ala 


Aro 

r\l y 


Spr 
oci 


Acn Val 1 pm 

r\op Va 1 LCU 


Thr 1 on Acn T1o 
1 111 LCU Mi>p I 1 c 








180 










185 






Leu 


Asp 


He 


Val 


Thr 


Thr 


A<;n 


Pro 


Prn 
r 1 u 


Prn Acn V/al 

ri U rNo^J Va 1 


Mi c V/al Qok> A^n 

nib vd i ocT Mrg 






195 










200 

L.UU 






C\JQ 


Val 


Glv 

U 1 J 


Glv 


Leu 


Glu 


A^n 


Gin 

U Ml 


1 PI I 


OCI 


Wal Arn Trn 
va i mi y 1 1 |j 


Cor 1 Dro Dr*^\ 

vd I jci rrO rTO 




210 










21^ 










Ala 


Leu 


Ljro 


no u 


Php 

r i ic 


1 PI 1 


Php 
r i ic 


Gin 

U 1 1 1 


Ala 


1 vc T\/r m n 
l_yb iyr um 


1 1 e Mrg iyr Arg 


225 




















240 


Val 


Glu 




Spr 


Val 

v a i 


Acn 

no [J 


Trn 


1 vc. 
Lyb 


Vd l 


Acn Acn 
Vd 1 rtb|J rtbp 


Cap Acn PI v\ 

va i oer asm u I n 










245 










250 


coo 


Thr 


Ser 


Cys 
j 


Ara 


Leu 


Ala 


Glv 


1 PU 

I— ^ Li 


Lvs 


Prn Glv Thr 

r i (j vj I jr I J II 


V^l Tvr Php Val 
va i iyr rile Vd l 








260 










265 




270 


Gin 


Val 


Arq 


Cys 


Asn 


Pro 


Phe 


Gly 


He 


Tvr Glv Ser 


1 v<; 1 v<; Al a ft! v 

L Jr O L jr O r\ I Q U 1 y 






275 










280 






285 


He 


Trp 


Ser 


Glu 


TrD 


Ser 


His 


Pro 


Thr 


Ala Ala Spr 


Thr Prn Arn ^pr 

1 1 ii n u r\\ y OCI 




290 










295 






300 


Glu 


Arq 


Pro 


Gly 


Pro 


Gly 


Glv 


Glv 


Ala 


Cvs Glu Pro 

w jr o uiu I I vj 


Ara Gl v Rl v ftl n 
ni y uiy uiy o i u 


305 










310 








315 




Pro 


Ser 


Ser 


Gly 


Pro 


Val 


Arg 


Ara 

ni y 


Glu 

VJ 1 u 


1 pu 1 v; Gl n 

LCU LJr o u 1 1 1 


php 1 pi i Rl v Trn 
ri ic lcu □ j y up 










325 










330 


335 


Leu 


Lys 


Lys 


His 


Ala 


Tyr 


Cys 


Ser 


Asn 


Leu Ser Phe 


Arg Leu Tyr Asp 








340 










345 




350 


Gin 


Trp 


Arg 


Ala 


Trp 


Met 


Gin 


Lys 


Ser 


His Lys Thr 


Arg Asn Gin His 






355 










360 






365 


Arg 


Thr 


Arg 


Gly 


Ser 


Cys 


Pro 


Arg 


Ala 


Asp Gly Ala 


Arg Arg Glu Val 




370 










375 






380 




Leu 


Pro 


Asp 


Lys 


Leu 















385 



(2) INFORMATION FOR SEQ ID NO: 30: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 389 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 





(xi) SEQUENCE 


DESCRIPTION: 


: SEQ ID NO:30: 




Ala 


His 


Thr 


Ala 


Val 


He 


Ser 


Pro 


Gin 


Asp Pro 


Thr Leu 


Leu lie u iy 


1 








5 










10 






JCI 


Ser 


Leu 


Leu 


Ala 


Thr 


Cys 


Ser 


Val 


His Gly 


Asp Pro 


Pro bly Ala 








20 










25 








Thr 


Ala 


Glu 


Gly 


Leu 


Tyr 


Trp 


Thr 


Leu 


Asn Gly 


Arg Arg 


Lgu Pro Pro 






35 










40 






45 




Glu 


Leu 


Ser 


Arg 


Val 


Leu 


Asn 


Ala 


Ser 


Thr Leu 


Ala Leu 


A ia Leu Ala 




50 










55 








60 




Asn 


Leu 


Asn 


Gly 


Ser 


Arg 


Gin 


Arg 


Ser 


Gly Asp 


Asn Leu 


Vd i Lys Ml s 


65 










70 








75 




80 


Ala 


Arg 


Asp 


Gly 


Ser 


He 


Leu 


Ala 


Gly 


Ser Cys 


Leu Tyr 


Vd t b iy LGU 










85 










90 






Pro 


Pro 


Glu 


Lys 


Pro 


Val 


Asn 


He 


Ser 


Cys Trp 


Ser Lys 


Moll rlcL LyS 








100 










105 






11 O 

11U 


Asp 


Leu 


Thr 


Cys 


Arg 


Trp 


Thr 


Pro 


Gly 


Ala His 


Gly Glu 


llli rile LcU 






115 










120 






125 




His 


Thr 


Asn 


Tyr 


Ser 


Leu 


Lys 


Tyr 


Lys 


Leu Arg 


Trp Tyr 


U 1 y u Ml Mop 




130 










135 








140 




Asn 


Thr 


Cys 


Glu 


Glu 


Tyr 


His 


Thr 


Val 


Gly Pro 


His Ser 


fvq Hi<; Tip 
o jr o n i o lie. 


145 










150 








155 




160 


Pro 


Lys 


Asp 


Leu 


Ala 


Leu 


Phe 


Thr 


Pro 


Tyr Glu 


lie Trp 


Val Glu Ala 










165 










170 




175 


Thr 


Asn 


Arg 


Leu 


Gly 


Ser 


Ala 


Arg 


Ser 


Asp Val 


Leu Thr 


Leu Asp He 








180 










185 






190 


Leu 


Asp 


Val 


Val 


Thr 


Thr 


Asp 


Pro 


Pro 


Pro Asp 


Val His 


Val Ser Arg 






195 










200 






205 


Val 


Gly 


Gly 


Leu 


Glu 


Asp 


Gin 


Leu 


Ser 


Val Arg 


Trp Val 


Ser Pro Pro 




210 










215 








220 




Ala 


Leu 


Lys 


Asp 


Phe 


Leu 


Phe 


Gin 


Ala 


Lys Tyr 


Gin He 


Arg Tyr Arg 


225 










230 








235 




240 


Val 


Glu 


Asp 


Ser 


Val 


Asp 


Trp 


Lys 


Val 


Val Asp 


Asp Val 


Ser Asn Gin 










245 










250 




255 


Thr 


Ser 


Cys 


Arg 


He 


Ala 


Gly 


Leu 


Lys 


Pro Gly 


Thr Val 


Tyr Phe Val 








260 










265 






270 
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Gin 


Val 


Arg Cys Asn 


Pro Phe Glv 


He 


Tyr Gly Ser 


Lvs Lvs Ala Glv 






275 


280 








285 


He 


Trp 


Ser Glu Trp 


Ser His Pro 


Thr 


Ala Ala Ser 


Thr Prn Arn Spp 




290 




295 






300 


Glu 


Arq 


Pro Gly Pro 


Glv Glv Glv 


Ala 


Cys Glu Pro 


Arn Glv Hlv Gin 


305 






310 






315 


3?0 


Pro 


Ser 


Spr Glv Prn 


Val Am Am 


Glu 


Leu Lys Gin 


riic Leu u ly irp 






325 






330 




OOD 


Leu 


Lys 


Lys His Ala 


Tyr Cys Ser 


Asn 


Leu Ser Phe 


Arg Leu Tyr Asp 






340 




345 






350 


Gin 


Trp 


Arg Ala Trp 


Met Gin Lys 


Ser 


His Lys Thr 


Arg Asn Gin His 






355 


360 








365 


Arg 


Thr 


Arg Gly Ser 


Cys Pro Arg 


Ala 


Asp Gly Ala 


Mry nrg u t u va I 




370 




375 






380 




l pij 


Pro 


Asp Lys Leu 












385 




















(2) INFORMATION FOR SEQ ID 


N0:31: 






(i) SEQUENCE CHARACTERISTICS: 












(A) LENGTH: 


389 amino acids 












(B) TYPE: amino acid 














(C) STRANDEDNESS: single 












(D) TOPOLOGY: linear 












(ii) MOLECULE 


TYPE; protein 










(xi) SEQUENCE 


DESCRIPTION: 


: SEQ ID 


N0:31: 




Ala 


His 


Thr Ala Val 


He Ser Pro 


Gin 


Asp 


Pro Thr 


1 pij 1 pij Hp Gl v 


1 




5 






10 




15 


Ser 


Ser 


Leu Leu Ala 


Thr Cys Ser 


Val 


His 


Gly Asp 


Pro Pro Glv Ala 






20 




25 






30 


Thr 


Ala 


Glu Gly Leu 


Tyr Trp Thr 


Leu 


Asn 


Gly Arg 


Ara Lpu Prn Prn 






35 


40 








45 


Glu 


Leu 


Ser Arg Val 


Leu Asn Ala 


Ser 


Thr 


Leu Ala 


1 pij Al a 1 pi i Al a 




50 




55 






60 




Asn 


Leu 


Asn Gly Ser 


Arg Gin Arg 


Ser 


Gly 


Asp Asn 


1 pij Val Pvc; Hi c 
LCU VCII vjro nlo 


65 






70 






75 


80 


Ala 


Arg 


Asp Gly Ser 


He Leu Ala 


Gly 


Ser 


Cys Leu 


Tyr Val Gly Leu 






85 






90 




95 


Pro 


Pro 


Glu Lys Pro 


Val Asn He 


Ser 


Cys 


Trp Ser 


Lys Asn Met Lys 






100 




105 






110 


Asp 


Leu 


Thr Cys Arg 


Trp Thr Pro 


Gly 


Ala 


His Gly 


Glu Thr Phe Leu 






115 


120 








125 
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His 


Thr 


Asn 


Tyr 


Ser 


Leu 


Lys 


Tyr 


Lys 


Leu Arg 


TrD Tyr Glv Gin Asd 

■ ■ J J villi rw L/ 




130 










135 








140 


Asn 


Thr 


Cys 


Glu 


Glu 


Tyr 


His 


Thr 


Val 


Gly Pro 


His Ser Cys His lie 


145 










150 








155 


160 


Pro 


Lys 


Asp 


Leu 


Ala 


Leu 


Phe 


Thr 


Pro 


Tyr Glu 


He TrD Val Glu Ala 










165 










170 


175 


Thr 


Asn 


Arg 


Leu 


Gly 


Ser 


Ala 


Ara 


Ser 


Asd Val 


Leu Thr Lpu Asn Tip 








180 










185 




190 


Leu 


Asp 


Val 


Val 


Thr 


Thr 


Asp 


Pro 


Pro 


Pro Asp 


Val His Val Ser Ara 






195 










200 






205 


Val 


Gly 


Gly 


Leu 


Glu 


Asp 


Gin 


Leu 


Ser 


Val Arg 


Trn Val Spr Pro Prn 




210 










215 








220 


Ala 


Leu 


Lys 


Asd 


Phe 


Leu 


Phe 


Gin 


Ala 


Lv*5 Tvr 


Gin Tip Arn Tvr Arn 
u in iic r\t y i y f /if y 


225 










230 








235 




Val 


Glu 


Asp 


Ser 


Val 


Asp 


TrD 


Lvs 


Val 


Val Asd 


Asn Val Spr Asn Gin 

no ^/ vui jci r\o 1 1 u I 1 1 










245 










250 




Thr 


Ser 


Cys 


Arc 


Leu 


He 


Glv 


Leu 




Pro Glv 


Thr Val Tvr Php Val 
i i ii va i i j \ r lie va i 








260 










265 




?70 


Gin 


Val 


Arq 


Cys 


Asn 


Pro 


Phe 


Glv 


He 


Tvr Glv 


^pr 1 vs 1 Ala Glv 






275 










280 






285 


He 


Trp 


Ser 


Glu 


TrD 


Ser 


His 


Pro 


Thr 


Ala Ala 


Spr Thr Prn Arn 9pr 




290 










295 








300 


61 u 


Arg 


Pro 


Gly 


Pro 


Gly 


Gly 


Gly 


Ala 


Cys Glu 


Pro Ara Glv Glv Glu 


305 










310 








315 


320 


Pro 


Ser 


Ser 


Gly 


Pro 


Val 


Ara 


Ara 


Glu 


Leu Lys 


Gin Phe Lpu Glv Trn 










325 










330 


335 


Leu 


Lys 


Lys 


His 


Ala 


Tyr 


Cys 


Ser 


Asn 


Leu Ser 


Phe Arg Leu Tyr Asp 








340 










345 




350 


Gin 


Trp 


Arg 


Ala 


Trp 


Met 


Gin 


Lys 


Ser 


His Lys 


Thr Arg Asn Gin His 






355 










360 






365 


Arg 


Thr 


Arg 


Gly 


Ser 


Cys 


Pro 


Arg 


Ala 


Asp Gly 


Ala Arg Arg Glu Val 




370 










375 








380 


Leu 


Pro 


Asp 


Lys 


Leu 















385 



(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID N0:32: 

Asn Ser Ala Arg 61 y Ala Cys Val Pro Arg Arg Ala Pro Pro Pro Pro 
15 10 15 

Ser Arg Ser Pro Pro Arg Ala Pro Gly Ser Ala Gly Pro Met Pro Ala 

20 25 30 

Gly Pro Met Pro Ala Gly Arg Arg Gly Pro Ala Ala Gin Ser Ala Arg 
35 40 45 

(2) INFORMATION FOR SEQ ID N0:33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:33: 

Asn Ser Ala Arg Gly Ala Cys Val Pro Arg Arg Ala Pro Pro Pro Pro 
15 10 15 

Ser Arg Ser Pro Pro Arg Ala Pro Gly Ser 
20 25 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:34: 

Ala Gly Pro Met Pro Ala Gly Pro Met Pro Ala Gly Arg Arg Gly Pro 
15 10 15 

Ala Ala Gin Ser Ala Arg 
20 

(2) INFORMATION FOR SEQ ID NO: 35: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:35: 

Ser Pro Pro Ala Leu Lys Asp Phe Leu Phe Gin Ala Lys Tyr Gin He 

15 10 15 

Arg Tyr Arg Val Glu Asp Ser Val Asp Trp Lys Val Val Asp Asp Val 
20 25 30 

(2) INFORMATION FOR SEQ ID NO:36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:36: 

Ala Ala Ser Thr Pro Arg Ser Glu Arg Pro Gly Pro Gly Gly Gly Ala 
15 10 15 

Cys Glu Pro Arg Gly Gly Glu Pro Ser Ser Gly Pro Val Arg Arg 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

Trp Met Gin Lys Ser His Lys Thr Arg Asn Gin Asp Glu Gly He Leu 
15 10 15 

Pro Ser Gly Arg Arg Gly Thr Ala Arg Gly Pro Ala Arg 
20 25 
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CLAIMS 
We claim: 

1. An isolated polynucleotide which encodes a 
mammalian polypeptide, said polypeptide being comprised of an 
amino acid sequence selected from the group consisting of SEQ 
ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, and SEQ ID NOs : 17 - 37 
or a polypeptide which is 90%, 95% or 99% identical to said 
amino acid sequences. 

2. The isolated polynucleotide of claim 1 wherein 
said polynucleotide is a DNA sequence. 

3 . The isolated polynucleotide of claim 1 wherein 
said polynucleotide is an RNA sequence. 

4. An expression vector comprising the following 
operably linked elements: 

a transcription promoter; 

a DNA segment encoding a mammalian polypeptide, 
said polypeptide being comprised of an amino acid sequence 
selected from the group consisting of SEQ ID NO: 2, SEQ ID NO: 
4, SEQ ID NO: 6, SEQ ID NOs : 17 - 37; and 

a transcription terminator. 

5. An isolated polypeptide said polypeptide being 
comprised of an amino acid sequence selected from the group 
consisting of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, and 
SEQ ID NOs: 17 - 31 or a polypeptide which is 90%, 95% or 99% 
identical to said amino acid sequences. 

6. A peptide or polypeptide which has the amino acid 
sequence of an epitope-bearing portion of a ZcytorS 
polypeptide. 
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7. A polypeptide of claim 6 wherein the polypeptide 
has amino acid sequence of at least 15 amino acid residues. 

8. The polypeptide of claim 7 wherein said 
polypeptide is selected from the group of polypeptide 
consisting of the amino acid sequences of SEQ ID NOs : 32-37. 

9. An antibody which specifically binds to an 
epitope-binding sequence of a ZcytorS polypeptide. 

10. An antibody of claim 9 wherein said antibody 
binds to a polypeptide comprised of an amino acid sequence 
selected from the group consisting of SEQ ID NO: 2, SEQ ID NO: 
4, SEQ ID NO: 6, and SEQ ID NOs : 17-37. 

11. An anti- idiotypic antibody of an antibody of 
claims 9 or 10. 

12. A method for producing an antibody which binds 
to a ZcytorS polypeptide comprising inoculating an animal with 
an epitope -bearing amino acid sequence of ZcytorS polypeptide 
under conditions wherein said animal produces antibodies which 
bind to the ZcytorS polypeptide; and 

isolating said antibodies. 
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